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AF165281 9497 bp mRNA linear PRI 17-AUG-1999 

Homo sapiens ATP cassette binding transporter 1 (ABC1) mRNA, 
complete cds . 
AF165281 

AF165281.1 GI:5734100 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 9497) 

Rust ,5., Rosier , M . , Funke,H., Real, J., Amoura,Z., Piette f J.C, 

Deleuze, J.F. , Brewer, H.B., Duverger,N., Denef le, P and Assmann, G . 

Tangier disease is caused by mutations in the gene encoding 

ATP-binding cassette transporter 1 

Nat. Genet. 22 (4), 352-355 (1999) 

99364413 

10431238 

2 (bases 1 to 9497) 

Rust.S., Rosier, M . , Funke,H., Real, J., 
Deleuze, J.F. , Brewer, H.B., Duverger,N. 
Direct Submission 

Submitted ( 01 -JUL- 1999 ) Genomics, Rhone- Poulenc Rorer, 
Cr#mieux, Evry 91006, France 

Location/Qualifiers 

1. .9497 

/organism- "Homo sapiens" 
/mol_type="mRNA" 



Amoura,Z., Piette,J.C, 
, Denef le, P. and Assmann, 



2 rue Gaston 



/db_xre f = 11 1 axon : 9 6 0 6 » 

/chromosome = 11 9 " 

/map=" 9q31" 
gene 1. .9497 

/gene="ABCl 11 
CDS 121. .6726 

/gene="ABCl 11 

/note="ABC transporter; ABC1 protein" 
/codon_start=l 

/product="ATP cassette binding transporter l" 
/protein_id="AAD49849 .1" 
/db_xref ="GI : 5 73 4101" 

/ 1 ran si at ion= "MPSAGTLPWVQGIICNANNPCFRYPTPGEAPGWGNFNKSIVAR 

LFSDARRLLLYSQKDTSMKDMRKVLRTLQQIKKSSSNLKLQDFLVDNETFSGFLYHNL 

SLPKSTVDKMLRADVILHKVFLQGYQLHLTSLCNGSKSEEMIQLGDQEVSELCGLPRE 

KLAAAERVLRSNMDILKPILRTLNSTSPFPSKELAEATKTLLHSLGTLAQELFSMRSW 

SDMRQEVMFLTNVNSSSSSTQIYQAVSRIVCGHPEGGGLKIKSLNWYEDNNYKALFGG 

NGTEEDAETFYDNSTTP YCNDLMKNLESS PLSRI I WKALKPLLVGKI LYTPDTPATRQ 

VMAE VN KT F Q E LAV F KD L EGMWEE LSPKI WT FM EN S QE MD L VRM L LD S RDND H F WE QQ 

LDGLDWTAQD I VAFLAKHPEDVQS SNGS VYTWREAFNETNQA I RT I SRFMECVNLNKL 

EPIATEVWLINKSMELLDERKFWAGIVFTGITPGSIELPHHVKYKIRMDIDNVERTNK 

IKDGYWDPGPRADPFEDMRYVWGGFAYLQDWEQAIIRVLTGTEKKTGVYMQQMPYPC 

YVDDIFLRVMSRSMPLFMTLAWIYSVAVIIKGIVYEKEARLKETMRIMGLDNSILWFS 

WFISSLIPLLVSAGLLWILKLGNLLPYSDPSWFVFLSVFAWTILQCFLISTLFSR 

ANLAAACGG I I YFTLYLPYVLCVAWQD YVGFTLKI FASLLS PVAFGFGCE YFALFEEQ 

GIGVQWDNLFESPVEEDGFNLTTSVSMMLFDTFLYGVMTWYIEAVFPGQYGIPRPWYF 

PCTKSYWFGEESDEKSHPGSNQKRISEICMEEEPTHLKLGVSIQNLVKVYRDGMKVAV 

DGLALNFYEGQITSFLGHNGAGKTTTMSILTGLFPPTSGTAYILGKDIRSEMSTIRQN 

LGVCPQHNVLFDMLTVEEHIWFYARLKGLSEKHVKAEMEQMALDVGLPSSKLKSKTSQ 

LSGGMQRKLSVALAFVGGSKWILDEPTAGVDPYSRRGIWELLLKYRQGRTIILSTHH 

MDEADVLGDRIAIISHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLSSCRNSSST 

VSYLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNLIRKHVSEARLVEDIGHELTY 

VLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEEIFLKVAEESGVDAETSDG 

TLPARRNRRAFGDKQSCLRPFTEDDAADPNDSDIDPESRETDLLSGMDGKGSYQVKGW 

KLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSL1VPPFGKYPSLELQ 

PWMYNEQYTFVSNDAPEDTGTLELLNALTKDPGFGTRCMEGNPIPDTPCQAGEEEWTT 

APVPQTIMDLFQNGNWTMQNPSPACQCSSDKIKKMLPVCPPGAGGLPPPQRKQNTADI 

LQDLTGRNI SDYLVKTYVQI I AKSLKNKI WVNEFRYGGFSLGVSNTQALPPSQEVNDA 

TKQMKKHLKLAKDS S ADRFLNS LGRFMTGLDTRNNVKVWFNNKGWHAI S S FLNVINNA 

ILRANLQKGENPSHYGITAFNHPLNLTKQQLSEVAPMTTSVDVLVSICVIFAMSFVPA 

SFWFLIQERVSKAKHLQFISGVKPVIYWLSNFVWDMCNYWPATLVIIIFICFQQKS 

YVSSTNLPVLALLLLLYGWSITPLMYPASFVFKIPSTAYVVLTSVNLFIGINGSVATF 

VLELFTDNKLNNINDILKSVFLIFPHFCLGRGLIDMVKNQAMADALERFGENRFVSPL 

SWDLVGRNLFAMAVEGWFFLITVLIQYRFFIRPRPVNAKLSPLNDEDEDVRRERQRI 

LDGGGQNDILEIKELTKIYRRKRKPAVDRICVGIPPGECFGLLGVNGAGKSSTFKMLT 

GDTTVTRGDAFLNRNSILSNIHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPE 

KEVGKVGEWAIRKLGLVKYGEKYAGNYSGGNKRKLSTAMALIGGPPWFLDEPTTGMD 

PKARRFLWNCALSWKEGRSWLTSHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNR 

FGDGYTIWRIAGSNPDLKPVQDFFGLAFPGSVPKEKHRNMLQYQLPSSLSSLARIFS 

ILSQSKKRLHIEDYSVSQTTLDQVFVNFAKDQSDDDHLKDLSLHKNQTWDVAVLTSF 

LQDEKVKESYV" 

BASE COUNT 2600 a 2115 c 2217 g 2564 t 1 others 

ORIGIN 

Query Match 100.0%; Score 6879; DB 9; Length 9497; 

Best Local Similarity 100.0%; Pred. No. 0 ; 

Matches 6879; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
QY 1 CAAACATGTCAGCTGTTACTGGAAGTGGCCTGGCCTCTATTTATCTTCCTGATCCTGATC 6 0 

Db 1 CAAACATGTCAGCTGTTACTGGAAGTGGCCTGGCCTCTATTTATCTTCCTGATCCTGATC 6 0 

Qy 61 TCTGTTCGGCTGAGCTACCCACCCTATGAACAACATGAATGCCATTTTCCAAATAAAGCC 12 0 

IIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

Db 61 TCTGTTCGGCTGAGCTACCCACCCTATGAACAACATGAATGCCATTTTCCAAATAAAGCC 12 0 

Qy 121 ATGCCCTCTGCAGGAACACTTCCTTGGGTTCAGGGGATTATCTGTAATGCCAACAACCCC 180 

IIIIIIIIIIIIMIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIllllll 

Db 121 ATGCCCTCTGCAGGAACACTTCCTTGGGTTCAGGGGATTATCTGTAATGCCAACAACCCC 180 

Qy 181 TGTTTCCGTTACCCGACTCCTGGGGAGGCTCCCGGAGTTGTTGGAAACTTTAACAAATCC 24 0 



181 



IIIMIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIMMIIIIIIIIIIIIIIIMII 

TGTTTCCGTTACCCGACTCCTGGGGAGGCTCCCGGAGTTGTTGGAAACTTTAACAAATCC 



240 



241 ATTGTGGCTCGCCTGTTCTCAGATGCTCGGAGGCTTCTTTTATACAGCCAG AAAG ACACC 3 0 0 

IMIMMIIIIIIIIIIIIIIMIMIIIIIMIIIIIIIlMlllllliiiiiiiiii 

241 ATTGTGGCTCGCCTGTTCTCAGATGCTCGGAGGCTTCTTTTATACAGCC AGAAAGACACC 3 0 0 
3 01 AG C ATG AAGG AC ATG CGC AAAGT T C TG AG AA C AT T A C AG C AG AT C AAG AAAT C C AG CTC A 360 

IIIIMIIIIIIMIIIIIMIIIIIIIIIIIIIIIIMIMMMIIIIIIIMIIIII 

AG C ATG AAGG AC ATG CG C AAAGT T C TG AG AAC ATT AC AG C AG AT C AAG AAAT C C AG C TC A 



301 
361 



360 
420 



AACTTGAAGCTTCAAGATTTCCTGGTGGACAATGAAACCTTCTCTGGGTTCCTGTATCAC 
361 AACTTGAAGCTTCAAGATTTCCTGGTGGACAATGAAACCTTCTCTGGGTTCCTGTATCAC 4 20 
4 21 AACCTCTCTCTCCCAAAGTCTACTGTGGACAAGATGCTGAGGGCTGATGTCATTCTCCAC 4 80 
4 21 AACCTCTCTCTCCCAAAGTCTACTGTGGACAAGATGCTGAGGGCTGATGTCATTCTCCAC 4 80 
4 81 AAGGTATTTTTGCAAGGCTACCAGTTACATTTGACAAGTCTGTGCAATGGATCAAAATCA 54 0 
4 81 AAGG TAT T T T TG C AAGG C T AC C AG T T AC AT T TG AC AAGTC TG TG C AAT GG AT C AAAAT C A 54 0 
541 GAAGAGATGATTCAACTTGGTGACCAAGAAGTTTCTGAGCTTTGTGGCCTACCAAGGGAG 600 
541 GAAGAGATGATTCAACTTGGTGACCAAGAAGTTTCTGAGCTTTGTGGCCTACCAAGGGAG 600 
6 01 AAACTGGCTGCAGCAGAGCGAGTACTTCGTTCCAACATGGACATCCTGAAGCCAATCCTG 660 

IIMIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIII 

601 AAACTGGCTGCAGCAGAGCGAGTACTTCGTTCCAACATGGACATCCTGAAGCCAATCCTG 660 
661 AGAACACTAAACTCTACATCTCCCTTCCCGAGCAAGGAGCTGGCCGAAGCCACAAAAACA 72 0 
661 AG AAC AC TAAACT C T A CAT CT C C CTT C C CG AGC AA GG AG CTGG CCG AAG C C AC AAAAAC A 72 0 
721 TTGCTGCATAGTCTTGGGACTCTGGCCCAGGAGCTGTTCAGCATGAGAAGCTGGAGTGAC 7 8 0 
721 TTGCTGCATAGTCTTGGGACTCTGGCCCAGGAGCTGTTCAGCATGAGAAGCTGGAGTGAC 7 80 
781 ATGCGACAGGAGGTGATGTTTCTGACCAATGTGAACAGCTCCAGCTCCTCCACCCAAATC 84 0 
781 ATGCGACAGGAGGTGATGTTTCTGACCAATGTGAACAGCTCCAGCTCCTCCACCCAAATC 84 0 
841 TACCAGGCTGTGTCTCGTATTGTCTGCGGGCATCCCGAGGGAGGGGGGCTGAAGATCAAG 90 0 
841 TACCAGGCTGTGTCTCGTATTGTCTGCGGGCATCCCGAGGGAGGGGGGCTGAAGATCAAG 90 0 
901 TCTCTCAACTGGTATGAGGACAACAACTACAAAGCCCTCTTTGGAGGCAATGGCACTGAG 960 
901 TCTCTCAACTGGTATGAGGACAACAACTACAAAGCCCTCTTTGGAGGCAATGGCACTGAG 96 0 
961 GAAGATGCTGAAACCTTCTATGACAACTCTACAACTCCTTACTGCAATGATTTGATGAAG 102 0 
961 GAAGATGCTGAAACCTTCTATGACAACTCTACAACTCCTTACTGCAATGATTTGATGAAG 1020 
1021 AATTTGGAGTCTAGTCCTCTTTCCCGCATTATCTGGAAAGCTCTGAAGCCGCTGCTCGTT 1080 

IINMIIMIIIIIIIIIMIIIIIMIIIIIIIllllllllllMIMIIIIIIIIII 

1021 AATTTGGAGTCTAGTCCTCTTTCCCGCATTATCTGGAAAGCTCTGAAGCCGCTGCTCGTT 10 80 
10 81 GGGAAGATCCTGTATACACCTGACACTCCAGCCACAAGGCAGGTCATGGCTGAGGTGAAC 114 0 

HIM 

1081 GGGAAGATCCTGTATACACCTGACACTCCAGCCACAAGGCAGGTCATGGCTGAGGTOAAC 114 0 
1141 AAGACCTTCCAGGAACTGGCTGTGTTCCATGATCTGGAAGGCATGTGGGAGGAACTCAGC 12 00 

NIIIMIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIMKIIIII 

1141 AAGACCTTCCAGGAACTGGCTGTGTTCCATGATCTGGAAGGCATGTGGGAGGAACTCAGC 12 00 
12 01 CCCAAGATCTGGACCTTCATGGAGAACAGCCAAGAAATGGACCTTGTCCGGATGCTGTTG 12 6 0 

iiiiMMiiiiiiiiiiiMiiiii miMiiiiiiiimmiiiimii 

12 01 CC CAAGATCTGGACCTTCATGGAG AAC AGCC AAGAAATGGAC CTTG PCCGG ATGCTGTTG 12 6 0 



12 61 GACAGCAGGGACAATGACCACTTTTGGGAACAGCAGTTGGATGGCTTAGATTGGACAGCC 132 0 
1261 GACAGCAGGGACAATGACCACTTTTGGGAACAGCAGTTGGATGGCTTAGATTGGACAGCC 132 0 

13 21 CAAGACATCGTGGCGTTTTTGGCCAAGCACCCAGAGGATGTCCAGTCCAGTAATGGTTCT 13 8 0 

MIIIMMIIIIIIIIIIIIIIIIIMIIIIMIIIMIINIIIIIIIIIIIIIIIII 

1321 CAAGACATCGTGGCGTTTTTGGCCAAGCACCCAGAGGATGTCCAGTCCAGTAATGGTTCT 13 8 0 
13 81 GTGTAC ACCTGGAGAG AAGCTTTCAACG AGACT AACC AGGCAATCCGGACC ATATCTCGC 14 4 0 
13 81 GTGT AC ACCTGGAGAGAAGCTTTCAACGAG ACT AACC AGGCAATCCGGACC ATATCTCGC 14 4 0 
1441 TTCATGGAGTGTGTCAACCTGAACAAGCTAGAACCCATAGCAACAGAAGTCTGGCTCATC 15 0 0 
1441 TTCATGGAGTGTGTCAACCTGAACAAGCTAGAACCCATAGCAACAGAAGTCTGGCTCATC 150 0 
1501 AACAAGTCCATGGAGCTGCTGGATGAGAGGAAGTTCTGGGCTGGTATTGTGTTCACTGGA 156 0 

IIIIIIIMIIIIMIIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMII 



1561 ATT AC T C C AGG C AG C AT TG AG CTG C C C C AT C ATG T C AAG T AC AAG AT C CG AATGG AC ATT 162 0 

IMMIIIIIIIMIIIIMIIIIIIMIIIIIIIIIIIIIIIMIIIIMIMIIIIII 

1561 AT T AC T C C AGG C AG C AT TG AG CTG C C C C AT C ATG T C AAGT AC AAG AT C CG AATGG AC ATT 1620 

1621 GACAATGTGGAGAGGACAAATAAAATCAAGGATGGGTACTGGGACCCTGGTCCTCGAGCT 168 0 

1621 GACAATGTGGAGAGGACAAATAAAATCAAGGATGGGTACTGGGACCCTGGTCCTCGAGCT 168 0 

16 81 GACCCCTTTGAGGACATGCGGTACGTCTGGGGGGGCTTCGCCTACTTGCAGGATGTGGTG 174 0 

16 81 GACCCCTTTGAGGACATGCGGTACGTCTGGGGGGGCTTCGCCTACTTGCAGGATGTGGTG 174 0 

1741 GAGCAGGCAATCATCAGGGTGCTGACGGGCACCGAGAAGAAAACTGGTGTCTATATGCAA 18 0 0 

1741 GAGCAGGCAATCATCAGGGTGCTGACGGGCACCGAGAAGAAAACTGGTGTCTATATGCAA 18 0 0 

18 01 CAGATGCCCTATCCCTGTTACGTTGATGACATCTTTCTGCGGGTGATGAGCCGGTCAATG 186 0 

18 01 CAGATGCCCTATCCCTGTTACGTTGATGACATCTTTCTGCGGGTGATGAGCCGGTCAATG 186 0 

18 61 CCCCTCTTCATGACGCTGGCCTGGATTTACTCAGTGGCTGTGATCATCAAGGGCATCGTG 192 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I 
18 61 CCCCTCTTCATGACGCTGGCCTGGATTTACTCAGTGGCTGTGATCATCAAGGGCATCGTG 192 0 

1921 TATGAGAAGGAGGCACGGCTGAAAGAGACCATGCGGATCATGGGCCTGGACAACAGCATC 1980 

1921 TATGAGAAGGAGGCACGGCTGAAAGAGACCATGCGGATCATGGGCCTGGACAACAGCATC 198 0 

1981 CTCTGGTTTAGCTGGTTCATTAGTAGCCTCATTCCTCTTCTTGTGAGCGCTGGCCTGCTA 2 04 0 

HI!!! IIIIIIIMIIIMIIIIIIIIIIIIMIIIIMMMIIIMIMI 

1981 CTCTGGTTTAGCTGGTTCATTAGTAGCCTCATTCCTCTTCTTGTGAGCGCTGGCCTGCTA 2 04 0 
2 041 GTGGTCATCCTGAAGTTAGGAAACCTGCTGCCCTACAGTGATCCCAGCGTGGTGTTTGTC 210 0 

IIIIIIIIIMIIIIIIIIIIIIIIIMIIIIMIIMllllllliiMiiiiiiiiiii 

2 041 GTGGTCATCCTGAAGTTAGGAAACCTGCTGCCCTACAGTGATCCCAGCGTGGTGTTTGTC 210 0 
2101 TTCCTGTCCGTGTTTGCTGTGGTGACAATCCTGCAGTGCTTCCTGATTAGCACACTCTTC 216 0 

IIMIIIIIIIMIIIIIIMIIIIIII 

2101 TTCCTGTCCGTGTTTGCTGTGGTGACAATCCTGCAGTGCTTCCTGATTAGCACACTCTTC 2160 
2161 TCCAGAGCCAACCTGGCAGCAGCCTGTGGGGGCATCATCTACTTCACGCTGTACCTGCCC 2220 

MI!IIMIIIIIII!IIIMII!III!IIII!IIII!II|||||||||!|||||||||| 

2161 TCCAGAGCCAACCTGGCAGCAGCCTGTGGGGGCATCATCTACTTCACGCTGTACCTGCCC 2 22 0 
2 2 21 TACGTCCTGTGTGTGGCATGGCAGGACTACGTGGGCTTCACACTCAAGATCTTCGCTAGC 2 2 8 0 

NIIIIMIIMIIIMIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

2 2 21 TACGTCCTGTGTGTGGCATGGCAGGACTACGTGGGCTTCACACTCAAGATCTTCGCTAGC 2 2 80 
22 81 CTGCTGTCTCCTGTGGCTTTTGGGTTTGGCTGTGAGTACTTTGCCCTTTTTGAGGAGCAG 2 34 0 

IIIIIIIIIIIIIIIIIIMIMIMIIIIIMIIIIIIIMIIIlllllllliiiiiH 

2 2 81 CTGCTGTCTCCTGTGGCTTTTGGGTTTGGCTGTGAGTACTTTGCCCTTTTTGAGGAGCAG 23 4 0 



2341 GGCATTGGAGTGCAGTGGGACAACCTGTTTGAGAGTCCTGTGGAGGAAGATGGCTTCAAT 24 00 

' M M IM II. M M : ll ,1 ,1 'I | 

2 341 GGCATTGGAGTGCAGTGGGACAACCTGTTTGAGAGTCCTGTGGAGGAAGATGGCTTCAAT 24 00 
2 4 01 CTCACCACTTCGGTCTCCATGATGCTGTTTGACACCTTCCTCTATGGGGTGATGACCTGG 24 6 0 

MM 

2-101 CTCACCACTTCGGTCTCCATGATGCTGTTTGACACCTTCCTCTATGGGGTGATGACCTGG 2460 
2 4 61 TACATTGAGGCTGTCTTTCCAGGCCAGTACGGAATTCCCAGGCCCTGGTATTTTCCTTGC 2 5 2 0 

IIMIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMII 

24 61 TACATTGAGGCTGTCTTTCCAGGCCAGTACGGAATTCCCAGGCCCTGGTATTTTCCTTGC 2 52 0 
2 5 21 ACCAAGTCCTACTGGTTTGGCGAGGAAAGTGATGAGAAGAGCCACCCTGGTTCCAACCAG 2 5 8 0 

MMMMMIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIMIIIIIMlllliiii 

2 521 ACCAAGTCCTACTGGTTTGGCGAGGAAAGTGATGAGAAGAGCCACCCTGGTTCCAACCAG 2 5 80 
2 5 81 AAGAGAATATCAGAAATCTGCATGGAGGAGGAACCCACCCACTTGAAGCTGGGCGTGTCC 264 0 

l l l l l j I I I I I I I ! I I I ! I I I I I I I I I I I I I I I I I I I I I M I! : I I I [ ! ! I I I I I 

1 ' ' ' ' ' ' ' 1 1 1 1 1 i if 1 1 1 i 1 1 1 1 1 1 f 1 1 1 ii ) 1 1 1 1 1 1 f 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 j i 

2 5 81 AAGAGAATATCAGAAATCTGCATGGAGGAGGAACCCACCCACTTGAAGCTGGGCGTGTCC 2 64 0 
2 641 ATTCAGAACCTGGTAAAAGTCTACCGAGATGGGATGAAGGTGGCTGTCGATGGCCTGGCA 2 70 0 
2 641 AT T C AG AAC CT GG T AAAAG T CT AC C G AG ATGG G ATG AAGG TGG CT G T CG ATG G C CTGG C A 2 70 0 
2 7 01 CTGAATTTTTATGAGGGCCAGATCACCTCCTTCCTGGGCCACAATGGAGCGGGGAAGACG 2 76 0 
2 7 01 CTGAATTTTTATGAGGGCCAGATCACCTCCTTCC 2 76 0 

2 761 ACCACCATGTCAATCCTGACCGGGTTGTTCCCCCCGACCTCGGGCACCGCCTACATCCTG 2 82 0 
2 761 ACCACCATGTCAATCCTGACCGGGTTGTTCCCCCCG 2 82 0 

2 8 21 GGAAAAGACATTCGCTCTGAGATGAGCACCATCCGGCAGAACCTGGGGGTCTGTCCCCAG 2 88 0 

MMIIIIIIIIMMMIIIIIIIIIIMIIIIKIIIIMMIIIIIIIIKIIIIII 

2 8 21 GG AAAAG AC AT T CG CT C T GAG ATG AG C A C CAT C C GG C AG AAC CTGGGGGTCTGTCCC C AG 2 8 8 0 
28 81 CATAACGTGCTGTTTGACATGCTGACTGTCGAAGAACACATCTGGTTCTATGCCCGCTTG 2 94 0 
2 8 S 1 CATAACGTGCTGTTTGACATGCTGACTGTCGAAGAACACATCTGGTTCTATGCCCGCTTG 2 94 0 
2 941 AAAGGGCTCTCTGAGAAGCACGTGAAGGCGGAGATGGAGCAGATGGCCCTGGATGTTGGT 3 0 0 0 

2 941 AAAGGGCTCTCTGAGAAGCACGTGAAGGCGGAGATGGAGCA^ 3 0 0 0 

3 0 01 TTG C C AT C AAG C AAG CT G AAAAG C AAAAC AAG C C AG CTG T C AG GTG G AAT G C AG AG AAAG 3 060 
3 0 01 T T G C CAT C AAG C AAG C TG AAAAG C AAAA C AAG C C AG CTG 1 C AGG TG G AAT GJ C AG AG AAAG 3 06 0 
3 0 61 CTATCTGTGGCCTTGGCCTTTGTCGGGGGATCTAAGGTTGTCATTCTGGATGAACCCACA 312 0 

IIIIIIIMMIIIIMIIIIIIIIIIMIlllllllliliiiiiiiiiiiiHiiiin 

3 0 61 CTATCTGTGGCCTTGGCCTTTGTCGGGGGATCTAAGGTTGTCATTCTGGATGAACCCACA 312 0 
3121 GCTGGTGTGGACCCTTACTCCCGCAGGGGAATATGGGAGCTGCTGCTGAAATACCGACAA 318 0 
3121 GCTGGTGTGGACCCTTACTCCCGCAGGGGAATATGGGAGCTGCTGCTGAAATACCGACAA 3180 
3181 GGCCGCACCATTATTCTCTCTACACACCACATGGATGAAGCGGACGTCCTGGGGGACAGG 3240 
3181 GGCCGCACCATTATTCTCTCTACACACCACATGGATGAAGCGGACGTCCTGGGGGACAGG 3 240 
3 241 ATTGCCATCATCTCCCATGGGAAGCTGTGCTGTGTGGGCTCCTCCCTGTTTCTGAAGAAC 3 3 00 



MMIMIMIIIII 

3 241 ATTGCCATCATCTCCCATGGGAAGCTGTGCTGTGTGGGCTCCTCCCTGTTTCTGAAGAAC 3 3 0 0 
3 3 01 CAGCTGGGAACAGGCTACTACCTGACCTTGGTCAAGAAAGATGTGGAATCCTCCCTCAGT 3 3 60 

IMMIIMIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIMIIIM 



3 3 01 CAGCTGGGAACAGGCTACTACCTGACCTTGGTCAAGAAAGATGTGGAATCCTCCCTCAGT 3 360 
33 61 TCCTGCAGAAACAGTAGTAGCACTGTGTCATACCTGAAAAAGGAGGACAGTGTTTCTCAG 34 2 0 

IIMMIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIM 



Db 33 61 TCCTGCAGAAACAGTAGTAGCACTGTGTCATACCTGAAAAAGGAGGACAGTGTTTCTCAG 34 2 0 

Qy 3421 AGCAGTTCTGATGCTGGCCTGGGCAGCGACCATGAGAGTGACACGCTGACCATCGATGTC 348 0 

i I I I i i I I I I I I I I i i I I I I I I i I i I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | 
Db 3421 AGCAGTTCTGATGCTGGCCTGGGCAGCGACCATGAGAGTGACACGCTGACCATCGATGTC 34 80 

Qy 34 81 TCTGCTATCTCCAACCTCATCAGGAAGCATGTGTCTGAAGCCCGGCTGGTGGAAGACATA 3 54 0 

I I I I I I I I I I I II I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | 

Db 34 81 TCTGCTATCTCCAACCTCATCAGGAAGCATGTGTCTGAAGCCCGGCTGGTGGAAGACATA 3 54 0 

Qy 3541 GGGCATGAGCTGACCTATGTGCTGCCATATGAAGCTGCTAAGGAGGGAGCCTTTGTGGAA 3600 

Db 3 541 GGGCATGAGCTGACCTATGTGCTGCCATATGAAGCTGCTAAGGAGGGAGCCTTTGTGGAA 3 60 0 

Qy 3 6 01 CTCTTTCATGAGATTGATGACCGGCTCTCAGACCTGGGCATTTCTAGTTATGGCATCTCA 3 66 0 

MIMIIIMMIIIIIMIIMIIIIIIIIIIIIMIMIIIMMIIIIIMMIIM 

Db 3601 CTCTTTCATGAGATTGATGACCGGCTCTCAGACCTGGGCATTTCTAGTTATGGCATCTCA 3 660 

Ov 36fi1 Ci'hrmm.ArTC'vnriHL afiaAA^Ai 1 ^^'"'^ Tinnrrrtrrinr** T\r<7i r*r, ,-im^^i/-i/~im^i^.- P , m^<^, m ^,-» 

— ^ ^- s. w uiannn j. / 1 x j. x vnnuy j. *jvjv- i-\jfmarturtu lOOUOHjtjrtlijLl b Alj _5 / ^ 0 

Db 3 661 GAGACGACCCTGGAAGAAATATTCCTCAAGGTGGCCGAAGAGAGTGGGGTGGATGCTGAG 3 72 0 

Qy 3721 ACCTCAGATGGTACCTTGCCAGCAAGACGAAACAGGCGGGCCTTCGGGGACAAGCAGAGC 3780 

Db 3 721 ACCTCAGATGGTACCTTGCCAGCAAGACGAAACAGGCGGGCCTTCGGGGACAAGCAGAGC 3 78 0 

Qy 3 781 TGTCTTCGCCCGTTCACTGAAGATGATGCTGCTGATCCAAATGATTCTGACATAGACCCA 3 84 0 

Db 3 781 TGTCTTCGCCCGTTCACTGAAGATGATGCTGCTGATCCAAATGATTCTGACATAGACCCA 3 84 0 

Qy 3 841 GAATCCAGAGAGACAGACTTGCTCAGTGGGATGGATGGCAAAGGGTCCTACCAGGTGAAA 3 90 0 

Db 3841 GAATCCAGAGAGACAGACTTGCTCAGTGGGATGGATGGCAAAGGGTCCTACCAGGTGAAA 3900 

Qy 3 901 GGCTGGAAACTTACACAGCAACAGTTTGTGGCCCTTTTGTGGAAGAGACTGCTAATTGCC 3 96 0 

Db 3 901 GGCTGGAAACTTACACAGCAACAGTTTGTGGCCCTTTTGTGGAAGAGACTGCTAATTGCC 3 96 0 

Qy 3961 AGACGGAGTCGGAAAGGATTTTTTGCTCAGATTGTCTTGCCAGCTGTGTTTGTCTGCATT 4 02 0 

Db 3961 AGACGGAGTCGGAAAGGATTTTTTGCTCAGATTGTCTTGCCAGCTGTGTTTGTCTGCATT 4 02 0 

Qy 4 021 GCCCTTGTGTTCAGCGTGATCGTGCCACCCTTTGGCAAGTACCCCAGCCTGGAACTTCAG 4 08 0 

Db 4 021 GCCCTTGTGTTCAGCCTGATCGTGCCACCCTTTGGCAAGTACCCCAGCCTGGAACTTCAG 4 08 0 

Qy 4 0 81 CCCTGGATGTAC AACGAACAGT ACACATTTGTCAGCAATG ATGCTCCTGAGGACACGGGA 414 0 

Db 4081 CCCTGGATGTACAACGAACAGTACACATTTGTCAGCAATGATGCTCCTGAGGACACGGGA 4140 

Qy 4141 ACCCTGGAACTCTTAAACGCCCTCACCAAAGACCCTGGCTTCGGGACCCGCTGTATGGAA 4200 

Db 4141 ACCCTGGAACTCTTAAACGCCCTCACCAAAGACCCTGGCTTCGGGACCCGCTGTATGGAA 4200 

Qy 4 2 01 GGAAACCCAATCCCAGACACGCCCTGCCAGGCAGGGGAGGAAGAGTGGACCACTGCCCCA 4 2 60 

Db 42 01 GGAAACCCAATCCCAGACACGCCCTGCCAGGCAGGGGAGGAAGAGTGGACCACTGCCCCA 426 0 

Qy 42 61 GTTCCCCAGACCATCATGGACCTCTTCCAGAATGGGAACTGGACAATGCAGAACCCTTCA 432 0 

M :i, ii 'i ii in in ii ii n h: 

Db 4 261 GTTCCCCAGACCATCATGGACCTCTTCCAGAATGGGAACTGGACAATGCAGAACCCTTCA 4320 

Qy 43 21 CCTGCATGCCAGTGTAGCAGCGACAAAATCAAGAAGATGCTGCCTGTGTGTCCCCCAGGG 4 380 

li. I I " II M| | r ||: || Ml ,: 

Db 4321 CCTGCATGCCAGTGTAGCAGCGACAAAATCAAGAAGATGCTGCCTGTGTGTCCCCCAGGG 43 80 

Qy 43 81 GCAGGGGGGCTGCCTCCTCCACAAAGAAAACAAAACACTGCAGATATCCTTCAGGACCTG 44 4 0 

IMIMIIIIIIMIIIIIIIIMIIIIIMIIIIIIIIIIIMIIIIIIIIIIIMMI 

Db 43 81 GCAGGGGGGCTGCCTCCTCCACAAAGAAAACAAAACACTGCAGATATCCTTCAGGACCTG 444 0 

Qy 4441 ACAGGAAGAAACATTTCGGATTATCTGGTGAAGACGTATGTGCAGATCATAGCCAAAAGC 4500 



IIMm!!llll!iii!iMI!IM!lllll!llll!lll!!lll!llliilllllMI 

Db 4441 ACAGGAAGAAACATTTCGGATTATCTGGTGAAGACGTATGTGCAGATCATAGCCAAAAGC 4500 

Qy 4501 TTAAAGAACAAGATCTGGGTGAATGAGTTTAGGTATGGCGGCTTTTCCCTGGGTGTCAGT 4560 

MIMIIIimilllllllllllllllllllMllllMIIMIMMIIIIIIIIill 

Db 4501 TTAAAGAACAAGATCTGGGTGAATGAGTTTAGGTATGGCGGCTTTTCCCTGGGTGTCAGT 4560 

Qy 4 5 61 AATACTCAAGCACTTCCTCCGAGTCAAGAAGTTAATGATGCCACCAAACAAATGAAGAAA 4 62 0 

IIMMIIIIMIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIMIIIMMIIM 

Db 45 61 AATACTCAAGCACTTCCTCCGAGTCAAGAAGTTAATGATGCCACCAAACAAATGAAGAAA 4 62 0 

Qy 4 621 CACCTAAAGCTGGCCAAGGACAGTTCTGCAGATCGATTTCTCAACAGCTTGGGAAGATTT 4 68 0 

INIMMIIIMIIIIIIIIIIIIIIIIIIIMIIMIIMMIIIMIIIIIIIMM 

Db 4 621 C AC C T AAAG CTG G C C AAG G AC AGT T C T G C AG AT C G AT T T CT C AAC AG CTTGGG AAG ATT T 4 68 0 

Qy 4 6 81 ATGACAGGACTGGACACCAGAAATAATGTCAAGGTGTGGTTCAATAACAAGGGCTGGCAT 4 74 0 

IMIMIIIIIIIIIIIIIIMIMIIIIIIIIIIIMIIIIIIIIIMIMIIIMIII 

Db 4 6 81 ATGACAGGACTGGACACCAGAAATAATGTCAAGGTGTGGTTCAATAACAAGGGCTGGCAT 4 74 0 

Qy 4741 GCAATCAGCTCTTTCCTGAATGTCATCAACAATGCCATTCTCCGGGCCAACCTGCAAAAG 4800 

Db 4 741 GCAATCAGCTCTTTCCTGAATGTCATCAACAATGCCATTCTCCGGGCCAACCTGCAAAAG 4800 

Qy 4 8 01 GGAGAGAACCCTAGCCATTATGGAATTACTGCTTTCAATCATCCCCTGAATCTCACCAAG 4860 

Db 4801 GG AG AG AAC C C T AG C C ATT ATGG AAT T AC TG CT T T C AAT C AT C C C CTG AAT CT C AC C AAG 4860 

Qy 4 861 CAGCAGCTCTCAGAGGTGGCTCCGATGACCACATCAGTGGATGTCCTTGTGTCCATCTGT 4 92 0 

Db 4 861 CAGCAGCTCTCAGAGGTGGCTCCGATGACCACATCAGTGGATGTCCTTGTGTCCATCTGT 4 92 0 

Qy 4 921 GTCATCTTTGCAATGTCCTTCGTCCCAGCCAGCTTTGTCGTATTCCTGATCCAGGAGCGG 4980 

IMIMIIIIIIIIIIIMIIMIIIMIIIIIIIIIIIIIIMIIIIIIMIIIIMII 

Db 4 921 GTCATCTTTGCAATGTCCTTCGTCCCAGCCAGCTTTGTCGTATTCCTGATCCAGGAGCGG 4 980 

Qy 4 981 GTCAGCAAAGCAAAACACCTGCAGTTCATCAGTGGAGTGAAGCCTGTCATCTACTGGCTC 5 04 0 

MMIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIMIIIIIilll 

Db 4 9 81 GTCAGCAAAGCAAAACACCTGCAGTTCATCAGTGGAGTGAAGCCTGTCATCTACTGGCTC 5 04 0 

Qy 5041 TCTAATTTTGTCTGGGATATGTGCAATTACGTTGTCCCTGCCACACTGGTCATTATCATC 5100 

Db 5 041 TCTAATTTTGTCTGGGATATGTGCAATTACGTTGTCCCTGCCACACTGGTCATTATCATC 510 0 

Qy 5101 TTCATCTGCTTCCAGCAGAAGTCCTATGTGTCCTCCACCAATCTGCCTGTGCTAGCCCTT 516 0 

IMIIMIillllMMIIIIIIIIIIilllllll 

Db 5101 TTCATCTGCTTCCAGCAGAAGTCCTATGTGTCCTCCACCAATCTGCCTGTGCTAGCCCTT 516 0 

Qy 5161 CTACTTTTGCTGTATGGGTGGTCAATCACACCTCTCATGTACCCAGCCTCCTTTGTGTTC 522 0 

Db 5161 CTACTTTTGCTGTATGGGTGGTCAATCACACCTCTCATGTACCCAGCCTCCTTTGTGTTC 522 0 

Qy 5221 AAGATCCCCAGCACAGCCTATGTGGTGCTCACCAGCGTGAACCTCTTCATTGGCATTAAT 52 80 

Db 52 21 AAGATCCCCAGCACAGCCTATGTGGTGCTCACCAGCGTGAACCTCTTCATTGGCATTAAT 52 80 

Qy 52 81 GGCAGCGTGGCCACCTTTGTGCTGGAGCTGTTCACCGACAATAAGCTGAATAATATCAAT 5340 

Db 52 81 GGCAGCGTGGCCACCTTTGTGCTGGAGCTGTTCACCGACAATAAGCTGAATAATATCAAT 534 0 

Qy 53 41 GATATCCTGAAGTCCGTGTTCTTGATCTTCCCACATTTTTGCCTGGGACGAGGGCTCATC 54 00 

Db 53 41 GATATCCTGAAGTCCGTGTTCTTGATCTTCCCACATTTTTGCCTGGGACGAGGGCTCATC 54 00 

Qy 54 01 GACATGGTGAAAAACCAGGCAATGGCTGATGCCCTGGAAAGGTTTGGGGAGAATCGCTTT 54 60 

IIIIMIIIIilNIIIIIIIIIIMIIilllllllillllllllllllllillMMii 

Db 54 01 GACATGGTGAAAAACCAGGCAATGGCTGATGCCCTGGAAAGGTTTGGGGAGAATCGCTTT 54 6 0 

Qy 54 61 GTGTCACCATTATCTTGGGACTTGGTGGGACGAAACCTCTTCGCCATGGCCGTGGAAGGG 55 2 0 

lllllllllMMIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIilll 

Db 5461 GTGTCACCATTATCTTGGGACTTGGTGGGACGAAACCTCTTCGCCATGGCCGTGGAAGGG 55 2 0 



Qy 5521 GTGGTGTTCTTCCTCATTACTGTTCTGATCCAGTACAGATTCTTCATCAGGCCCAGACCT 5580 

Db 5 521 GTGGTGTTCTTCCTCATTACTGTTCTGATCCAGTACAGATTCTTCATCAGGCCCAGACCT 5 58 0 

Qy 5581 GTAAATGCAAAGCTATCTCCTCTGAATGATGAAGATGAAGATGTGAGGCGGGAAAGACAG 5 64 0 

IMIIillllMMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIl 

Db 5581 G T AAATG C AAAG C T AT CT C CT C TG AATG ATG AAG ATG AAG ATG TG AGG CGGG AAAG AC AG 5640 

Qy 5 641 AG AATT CT TG ATGG TGG AG G C C AG AATG AC AT C T T AG AAATC AAGG AG T TG ACG AAG AT A 570 0 

imiiiiiiiiiiiiiMiMiiiimiiiiiiiiiiiiiii 

Db 5641 AGAATTCTTGATGGTGGAGGCCAGAATGACATCTTAGAAATCAAGGAGTTGACGAAGATA 5700 

Qy 57 01 TATAGAAGGAAGCGGAAGCCTGCTGTTGACAGGATTTGCGTGGGCATTCCTCCTGGTGAG 576 0 

Db 5 7 01 TATAGAAGGAAGCGGAAGCCTGCTGTTGACAGGATTTGCGTGGGCATTCCTCCTGGTGAG 5 76 0 

Qy 5 761 TGCTTTGGGCTCCTGGGAGTTAATGGGGCTGGAAAATCATCAACTTTCAAGATGTTAACA 5 82 0 

MIIIIIIIIIIIIIIIMIIIIMIIIIIIMIIIIMIIIIIIIIIIIIMMIMM 

->>•->■•- xwv-xxivjy^^^^i uuurvu i inn i uuuuv. 1 ^Urtrtrtrt 1 Lrt 1 LrtHt 1 1 1 LAAGA'IGtTAACA bb^O 

Qy 5 8 21 GG AG AT AC C AC TG T T AC C AG AGG AG ATG C TT T C C TT AAC AG AAAT AG T AT CTT AT CAAAC 5 88 0 

Db 5821 GG AG AT AC C AC TG TT AC C AG AGG AG ATG C T TT C CTTAACAG AAAT AG TAT C TT AT CAAAC 5 88 0 

Qy 5 8 81 ATCCATGAAGTACATCAGAACATGGGCTACTGCCCTCAGTTTGATGCCATCACAGAGCTG 5 94 0 

Db 5 8 81 ATCCATGAAGTACATCAGAACATGGGCTACTGCCCTCAGTTTGATGCCATCACAGAGCTG 5 94 0 

Qy 5 941 TTGACTGGGAGAGAACACGTGGAGTTCTTTGCCCTTTTGAGAGGAGTCCCAGAGAAAGAA 6 00 0 

IMMMIIIIIIIMIIIIIIimilllllllMIIIIMIIIIMMMimilM 

Db 5941 TTGACTGGGAGAGAACACGTGGAGTTCTTTGCCCTTTTGAGAGGAGTCCCAGAGAAAGAA 6000 

Qy 6001 GTTGGCAAGGTTGGTGAGTGGGCGATTCGGAAACTGGGCCTCGTGAAGTATGGAGAAAAA 6 06 0 

Db 60 01 GTTGGCAAGGTTGGTGAGTGGGCGATTCGGAAACTGGGCCTCGTGAAGTATGGAGAAAAA 6060 

Qy 60 61 TATGCTGGTAACTATAGTGGAGGCAACAAACGCAAGCTCTCTACAGCCATGGCTTTGATC 612 0 

Db 6061 TATGCTGGTAACTATAGTGGAGGCAACAAACGCAAGCTCTCTACAGCCATGGCTTTGATC 612 0 

Qy 6121 GGCGGGCCTCCTGTGGTGTTTCTGGATGAACCCACCACAGGCATGGATCCCAAAGCCCGG 6180 

Db 6121 GGCGGGCCTCCTGTGGTGTTTCTGGATGAACCCACCACAGGCATGGATCCCAAAGCCCGG 6180 

Qy 6181 CGGTTCTTGTGGAATTGTGCCCTAAGTGTTGTCAAGGAGGGGAGATCAGTAGTGCTTACA 6 24 0 

Db 6181 CGGTTCTTGTGGAATTGTGCCCTAAGTGTTGTCAAGGAGGGGAGATCAGTAGTGCTTACA 624 0 

Qy 6241 TCTCATAGTATGGAAGAATGTGAAGCTCTTTGCACTAGGATGGCAATCATGGTCAATGGA 6300 

Db 6241 TCTCATAGTATGGAAGAATGTGAAGCTCTTTGCACTAGGATGGCAATCATGGTCAATGGA 63 0 0 

Qy 63 01 AGGTTCAGGTGCCTTGGCAGTGTCCAGCATCTAAAAAATAGGTTTGGAGATGGTTATACA 6360 

Db 6301 AGGTTCAGGTGCCTTGGCAGTGTCCAGCATCTAAAAAATAGGTTTGGAGATGGTTATACA 6360 

Qy 63 61 ATAGTTGTACGAATAGCAGGGTCCAACCCGGACCTGAAGCCTGTCCAGGATTTCTTTGGA 64 2 0 

Db 6361 ATAGTTGTACGAATAGCAGGGTCCAACCCGGACCTGAAGCCTGTCCAGGATTTCTTTGGA 64 2 0 

Qy 6421 CTTGCATTTCCTGGAAGTGTTCCAAAAGAGAAACACCGGAACATGCTACAATACCAGCTT 6480 

IIIIIIIIIIIIIIIIIIMIIIIII 

Db 64 21 CTTGCATTTCCTGGAAGTGTTCCAAAAGAGAAACACCGGAACATGCTACAATACCAGCTT 64 80 

Qy 6481 CCATCTTCATTATCTTCTCTGGCCAGGATATTCAGCATCCTCTCCCAGAGCAAAAAGCGA 6540 

IIMMIIIIIIIIIIIMIIIIIIIIIIIIIIMIMIMIIIIIIIIIIIIIIIIIII 

Db 6481 CCATCTTCATTATCTTCTCTGGCCAGGATATTCAGCATCCTCTCCCAGAGCAAAAAGCGA 6540 



Qy 6541 CTCCACATAGAAGACTACTCTGTTTCTCAGACAACACTTGACCAAGTATTTGTGAACTTT 6 6 00 

IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMllllliiiiiiiiiiiiiiHi 

CTCCACATAGAAGACTACTCTGTTTCTCAGACAACACTTGACCAAGTATTTGTGAACTTT 66 00 



Db 6541 





6 601 


G C C AAGG AC C AAAG TG ATG AT G AC C AC TT AAAAG AC C T CT CATT AC AC AAAAA C C AG AC A 

IMIIIIIMIIIIIIIIIIIIMIMIIIIIIIMIIIIIMIIIIIIIIMIIIMII 

GCCAAGGACCAAAGTGATGATGACCACTTAAAAGACCTCTCATTACACAAAAACCAGACA 


6660 


JJJD 


6 6 01 


6660 


Qy 


6 661 


GTAGTGGACGTTGCAGTTCTCACATCTTTTCTACAGGATGAGAAAGTGAAAGAAAGCTAT 

lllillllllllllllllMMIIIIIIKIIMIIIIIIIIIIIIIMIIMIIIIIll 

GTAGTGGACGTTGCAGTTCTCACATCTTTTCTACAGGATGAGAAAGTGAAAGAAAGCTAT 


6720 


Tl'K 
JJD 


6 661 


6720 


Qy 


6721 


GTATGAAGAATCCTGTTCATACGGGGTGGCTGAAAGTAAAGAGGGACTAGACTTTCCTTT 

I N 1 1 1 1 1 1 M 1 1 1 ! 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 i 1 1 1 1 1 | 1 

GTATGAAGAATCCTGTTCATACGGGGTGGCTGAAAGTAAAGAGGNACTAGACTTTCCTTT 


6780 


Db 


6721 


6780 


Qy 


6781 


G C A C C ATGTG AAG TGTTG TGG AG AAAAG AG C C AG AAG TTG ATG TGG G AAG AAGT AAACTG 

IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIII 

G C AC C AT GT G AAGTGT TGTGG AG AAAAG AGC C AG AAG T TG ATG TGGG AAG AAGT AAA C TG 


6840 


Db 


6781 


6840 


Qy 


6841 


G AT A C TG T ACTG AT ACT AT TC AAT G C AAT G C AAT T C AAT G 6880 

M 1 1 1 M 1 1 1 1 I 1 1 1 1 ! 1 1 1 1 M 1 1 ! [MINI 




Db 


6841 


1 f f 1 1 1 1 ) 1 I 1 1 I I I f )| 1 1 1 ! 1 1 1 1 1 1 1 1 M I M 1 1 1 ! 
GATACTGTACTGATACTATTCAATGCAATGCAATTCAATG 68 8 0 





SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 




68 


80 


100 


.0 


6880 


21 


AAZ94734 


Human ATP binding 


2 




68 


80 


100 


. 0 


6880 


22 


AAI70314 


Human ATP binding 


3 




68 


80 


100 


.0 


7260 


22 


AAD21326 


Human ATP binding 


4 




68. 


80 


100 


.0 


7260 


22 


AAI70315 


Human ATP binding 


5 




68' 


79 


100 


. 0 


9497 


24 


ABV7 802 0 


Hypoxia- regulated 


6 


6 


876 


. 8 


100 


. 0 


6880 


22 


AAD21325 


Human ATP binding 


7 


6. 


875 


. 8 


99 


. 9 


9741 


22 


AAS06120 


Human ABC1 


DNA seq 


8 


6375 


.8 


99 


. 9 


9741 


24 


AAD37273 


Human ABC1 


full-le 


9 


6; 


B75 


. 8 


99 


. 9 


9854 


22 


AAS06121 


Human ABC1 


DNA seq 


10 


6i 


375 


. 8 


99 


. 9 


9870 


24 


ABN99301 


Polymorphic 


human 


11 


6i 


375 


. 8 


99 


.9 


9870 


24 


ABN99302 


Polymorphic 


human 


12 


6! 


375 


. 8 


99 


. 9 


9870 


24 


ABN9 93 03 


Polymorphic 


human 


13 


6! 


375 


. 8 


99 


. 9 


9870 


24 


ABN9932 4 


Polymorphic 


human 


14 


61 


375 


. 8 


99 


.9 


9870 


24 


ABN99328 


Polymorphic human 


15 


61 


375 


.8 


99 


.9 


9870 


24 


ABN99329 


Polymorphic 


human 


16 


61 


375 


.8 


99 


. 9 


9870 


24 


ABN993 3 0 


Polymorphic 


human 


17 


6875 


. 8 


99 


.9 


9870 


24 


ABN99331 


Polymorphic human 


18 


6f 


375 


.8 


99 


.9 


9870 


24 


ABN99332 


Polymorphic 


human 


19 


6! 


375 


. 8 


99 


. 9 


9870 


24 


ABN993 3 3 


Polymorphic 


human 


20 


6f 


375 


.8 


99 


. 9 


9870 


24 


ABN99334 


Polymorphic human 


21 


6875 . 


. 8 


99 , 


.9 


9870 


24 


ABN99335 


Polymorphic 


human 


22 


6t 


375. 


.2 


99 , 


. 9 


7281 


22 


AAK51683 


Human polynucleoti 


23 


61 


374 , 


. 2 


99 , 


. 9 


9741 


24 


ABL58146 


Human ABCA1 


transp 


24 


Si 


374 , 


. 2 


99 . 


. 9 


9870 


24 


ABN99304 


Polymorphic 


human 


25 


6£ 


374 . 


. 2 


99 . 


. 9 


9870 


24 


ABN9 93 05 


Polymorphic 


human 


26 


6* 


374 . 


. 2 


99 . 


. 9 


9870 


24 


ABN99306 


Polymorphic 


human 


27 


6£ 


374 . 


.2 


99 . 


, 9 


9870 


24 


ABN99307 


Polymorphic 


human 


28 


Si 


374 . 


, 2 


99 . 


9 


9870 


24 


ABN993 0 9 


Polymorphic 


human 


29 


6£ 


374 . 


. 2 


99 . 


9 


9870 


24 


ABN99310 


Polymorphic 


human 


30 


6i 


374 . 


, 2 


99 . 


9 


9870 


24 


ABN99311 


Polymorphic 


human 


31 


6£ 


374 . 


2 


99 . 


9 


9870 


24 


ABN99312 


Polymorphic 


human 


32 


6£ 


374 . 


2 


99. 


9 


9870 


24 


ABN99313 


Polymorphic 


human 


33 


6f 


374 . 


2 


99 . 


9 


9870 


24 


ABN99314 


Polymorphic 


human 


34 


6i 


!74 . 


2 


99. 


9 


9870 


24 


ABN99315 


Polymorphic 


human 


35 


6874 . 


2 


99. 


9 


9870 


24 


ABN99316 


Polymorphic 


human 


36 


66 


174 . 


2 


99 . 


9 


9870 


24 


ABN99317 


Polymorphic 


human 


37 


6£ 


S74 . 


2 


99. 


9 


9870 


24 


ABN99318 


Polymorphic 


human 


38 


6£ 


S74 . 


2 


99. 


9 


9870 


24 


ABN99319 


Polymorphic 


human 


39 


6£ 


S74 . 


2 


99. 


9 


9870 


24 


ABN993 20 


Polymorphic 


human 


40 


68 


174 . 


2 


99 . 


9 


9870 


24 


ABN99321 


Polymorphic 


human 


41 


6£ 


174 . 


2 


99 . 


9 


9870 


24 


ABN9 93 2 2 


Polymorphic 


human 


42 


68 


!74 . 


2 


99. 


9 


9870 


24 


ABN99323 


Polymorphic 


human 


43 


6£ 


!74 . 


2 


99. 


9 


9870 


24 


ABN99325 


Polymorphic 


human 


44 


6£ 


174 . 


2 


99. 


9 


9870 


24 


ABN9 93 2 6 


Polymorphic 


human 



45 6874.2 99.9 9870 24 ABN99327 



Polymorphic human 



SUMMARIES 



Result 



% 

Query 





No. 


Score 


Match 


Length 


DB 


ID 




1 


430.8 


6 


.3 


5894 


3 


US-08-665-259-24 




2 


430. 8 


6 


.3 


5894 


3 


US-08-762-500-24 




3 


430.8 


6 


. 3 


6525 


3 


US-08-762-500-74 




4 


99 . 4 


1 


. 4 


1008 


4 


US-09-252-991A 4266 


c 


5 


98 . 8 


1 


. 4 


999 


4 


US-09-252-991A-4588 




6 


89.4 


1 


.3 


1614 


4 


US-09-2 52-991A-4198 




7 


85 . 8 


1 


. 2 


1824 


4 


US-09-252-991A-178 0 




8 


83.8 


1 


2 


1176 


4 


US-09-252-991A-3704 


c 


9 


83 . 8 


1 


2 


2253 


4 


US-09-252-991A-3 810 




10 


81 


1 


2 


3 6181 


4 


US -08^31173 1 A - 1 2 0 




11 


78 . 4 


1 


1 


1548 


4 


US-09-252-991A-3653 


c 


12 


77 . 4 


1 


1 


762 


4 


US-09-252-991A-193 8 


c 


13 


76.6 


1 


1 


4411529 


3 


US-09-103-840A-1 




14 


75 .2 


1 


1 


969 


4 


US-09-107-532A-1008 




15 


75 


1 


1 


1770 


4 


US-09-328-352-3466 



Description 



Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 

Sequence 
Sequence 
Sequence 



Ap 
Ap 



24, Appl 
24, Appl 
74, Appl 
4266, Ap 
4588, 
4198, 
1780, Ap 
3704, Ap 
3810, Ap 
12 0, App 
3653, Ap 
1938, Ap 

1, Appli 
1008 7 Ap 
3466, Ap 



SUMMARIES 

% 

Result Query 



c 



No. 


Score 


Match 


Length DB 


ID 


Description 


1 


3600 .4 


52 


.3 


4783 


11 


AK051920 


AK051920 Mus muscu 


2 


1024 . 8 


14 


. 9 


3605 
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RESULT 1 
AK051920 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 



AK051920 4783 bp mRNA linear HTC 05-DEC-2002 

Mus musculus 12 days embryo eyeball cDNA, RIKEN full-length 
enriched library, clone : D23 0019D04 product : ATP-binding cassette, 
sub-family A (ABC1) , member 1, full insert sequence. 
AK0 5192 0 

AK051920.1 GI:26342297 

HTC; CAP trapper. 

Mus musculus (house mouse) 
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Takeda,Y., Tanaka,T., Tomaru,A., Toya,T., Yasunishi , A . , 
Muramatsu, M. and Hayashizaki , Y . 
Direct Submission 

Submitted { 16 -JUL- 2001 ) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 23 0-0045, Japan (E-mail ; genome- re s@gsc . riken . go . jp , 
URL: http: //genome .gsc . riken. go . jp/ , Tel : 81-45 -503 - 9222 , 
Fax: 81- 45-503-9216) 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL : http : / /genome . gsc . riken . go . jp/ 
URL : http : / / f antom . gsc . riken . go . jp/ . 

— uvatiwu/ yuQj.j.L±CJ.i5 

1- .4783 

/organism="Mus musculus" 

/mol_t ype = " mRNA " 

/strain= ,, C5 7BL/6J" 

/ db_xref = "FANTOM_DB : D2 30 019D04 11 

/db_xref="taxon: 10090" 

/clone="D2 3 0 019D04 " 

/tissue_type= "eyeball" 

/clone_lib= "RIKEN full-length enriched mouse cDNA library" 
/dev_stage="12 days embryo" 
322. .>4782 

/not e=" unnamed protein product; ATP-binding cassette, 

sub- family A (ABC1) , member 1 (MGD | MGI : 99607 , 

GB |NM_0134 54 , evidence: BLASTN, 99%, match=4545) 

putative" 

/codon_start=l 

/protein_id="BAC34811 . 1" 

/db_xref ="GI :26342298" 

/ translat ion= "MACWPQLRLLLWKNLTFRRRQTCQLLLEVAWPLFIFLILISVRL 
SYPPYEQHECHFPNKAMPSAGTLPWVQGIICNANNPCFRYPTPGEAPGWGNFNKSIV 
SRLFSDAQRLLLYSQRDTSIKDMHKVLRMLRQIKHPNSNLKLQDFLVDNETFSGFLQH 
NLSLPRSTVDSLLQANVGLQKVFLQGYQLHLASLCNGSKLEEIIQLGDAEVSALCGLP 
RKKLDAAERVLRYNMDILKPWTKLNSTSHLPTQHLAEATTVLLDSLGGLAQELFSTK 
SWSDMRQEVMFLTNVNSSSSSTQIYQAVSRIVCGHPEGGGLKIKSLNWYEDNNYKALF 
GGNNTEEDVDTFYDNSTTPYCNDLMKNLESSPLSRIIWKALKPLLVGKILYTPDTPAT 
RQVMAEVNKTFQELAVFHDLEGMWEELSPQIWTFMENSQEMDLVRTLLDSRGNDQFWE 
QKLDGLDWTAQDIMAFLAKNPEDAQSPNGSVYTWREAFNETNQAIQTISRFMECVNLN 
KLEPIPTEVRLINKSMELLDERKFWAGIVFTGITPDSVELPHHVKYKIRMDIDNVERT 
NKIKDGYWDPGPRADPFEDMRYVWGGFAYLQDWEQAIIRVLTGSEKKTGVYVQQMPY 
PCYVDDIFLRVMSRSMPLFMTLAWIYSVAVIIKSIVYEKEARLKETMRIMGLDNGILW 
FSWFVSSLIPLLVSAGLLWILKLGNLLPYSDPSWFVFLSVFAMVTILQCFLISTLF 
SRANLAAACGGIIYFTLYLPYVLCVAWQDYVGFSIKIFASLLSPVAFGFGCEYFALFE 
EQGIGVQWDNLFESPVEEDGFNLTTAVSMMLFDTFLYGVMTWYIEAVFPGQYGIPRPW 
YFPCTKSYWFGEEIDEKSHPGSSQKGVSEICMEEEPTHLRLGVSIQNLVKVYRDGMKV 
AVDGLALNFYEGQITSFLGHNGAGKTTTMSILTGLFPPTSGTAYILGKDIRSEMSSIR 
QNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEKHVKAEMEQMALDVGLPPSKLKSKT 
SQLSGGMQRKLSVALAF VGGSKWILDEPTAGVDPYSRRGIWELLLKYRQGRTIILST 
HHMDEADILGDRIAIISHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLSSCRNSS 
STVSCLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNLIRKHVSEARLVEDIGHEL 
TYVLPYEAAKEGAFVELFHE IDDRLSDLGI SS YGI SETTLEEI FLKVAEESGVDAETS 
DGTLPARRNRRAFGDKQSCLHPFTEDDAVDPNDSDIDPESRETDLLSGMDGKGSYQLK 
GWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSLIVPPFGKYPSLE 
LQPWMYNEQYTFVSNDAPEDMGTQELLNALTKDPGFGTRCMEGNPIPDTPCLAGEEDW 
TISPVPQSIVDLFQNGNWTMKNPSPACQCSSDKIKKMLPVCPPGAGGLPPP" 
1119 a 1236 c 1310 g 1118 t 



Query Match 52.3%; Score 3600.4; 

Best Local Similarity 8 8.6%; Pred . No. 0; 
Matches 3901; Conservative 0; Mismatches 



DB 11; Length 4 78 3; 
5 01; Indels 0; 



Gaps 



QY 1 CAAACATGTCAGCTGTTACTGGAAGTGGCCTGGCCTCTATTTATCTTCCTGATCCTGATC 60 



mi in,!. 1 , 1:111,11,; 

3 8 2 CAAACATGTCAGCTGTTACTGGAAGTGGCCTGGCCTCTCTTTATCTTCCTGATCCTGATC 4 41 
61 TCTGTTCGGCTGAGCTACCCACCCTATGAACAACATGAATGCCATTTTCCAAATAAAGCC 12 0 

HIM II I Mlllilllll IIIIIIMIH Mill Mill II 

442 TCTGTACGCCTGAGCTACCCACCCTACGAACAACATGAGTGCCACTTTCCGAATAAAGCC 5 01 
121 ATGCCCTCTGCAGGAACACTTCCTTGGGTTCAGGGGATTATCTGTAATGCCAACAACCCC 18 0 

I' IN Mlllilllll II II Mm || || 1 1| || Ml III II II I 

5 02 ATGCCGTCTGCAGGAACCCTCCCCTGGGTACAGGGGATTATCTGTAATGCCAACAACCCC 5 61 
181 TGTTTCCGTTACCCGACTCCTGGGGAGGCTCCCGGAGTTGTTGGAAACTTTAACAAATCC 24 0 

II IIINIII M HIM II MMIIIMM IIMMMMMIMIMM Ml 

5 62 TGCTTCCGTTATCCAACTCCCGGCGAGGCTCCCGGTGTTGTTGGAAACTTTAACAAGTCC 621 
241 ATTGTGGCTCGCCTGTTCTCAGATGCTCGGAGGCTTCTTTTATACAGCCAGAAAGACACC 3 0 0 

II I IHMMI MM MIMIIIM I MINIM I III III 

622 ATCGTGTCTCGCCTGTTCTCAGACGCTCAGAGGCTTCTTCTGTACAGCCAAAGAGATACC 6 81 
301 AGCATGAAGGACATGCGCAAAGTTCTGAGAACATTACAGCAGATCAAGAAATCCAGCTCA 36 0 

Hill MMIIIMI Ml II M II I II MM IIIMMIM I III MM 

6 82 AG C ATT AAGG AC ATG C A C AAGGT C CTG AG AATG T T ACGG C AG AT C AAG C ATC C C AACT C A 741 



420 



361 AACTTGAAGCTTCAAGATTTCCTGGTGGACAATGAAACCTTCTCTGGGTTCCTGTATCAC 

II HIIIIII M Mill IMMMMIMIMM I II I M M MINI I Ml 

742 AATTTGAAGCTCCAGGATTTTCTGGTGGACAATGAAACATTCTCTGGATTCCTGCAGCAC 801 
'421 AACCTCTCTCTCCCAAAGTCTACTGTGGACAAG ATGCTGAGGGCTGATGTCATTCTCCAC 480 

II I H II IMI IMIMIMMM Mill III MM IIMM 

AATTTGTCCCTTCCAAGATCTACTGTGGACAGCCTGCTGCAGGCGAATGTTGGTCTCCAG 861 



802 



481 AAGG T ATTTTT G C AAGG CT A CC AG TT AC ATTTG AC AAGT C TGTG C AAT GG AT C AAAAT C A 540 

IIIIIIIMIMIIIIMMMI MIMIIM I MMIMI M MIMIIIM I 

862 AAGGTATTTTTGCAAGGCTACCAATTACATTTGGCCAGTCTGTGTAACGGATCAAAATTA 921 
541 GAAGAGATGATTC7AACTTGGTGACCAAGAAGTTTCTGAGCTTTGTGGCCTACCAAGGGAG 600 

HMI II Mill MMIMI IIIMMIM II Mill Mill III II 

92 2 GAAGAAATTATTCAGCTTGGTGATGCGGAAGTTTCTGCCCTCTGTGGTCTACCGAGGAAG 981 
6 01 AAACTGGCTGCAGCAGAGCGAGTACTTCGTTCCAACATGGACATCCTGAAGCCAATCCTG 660 

II II I HUH HI HUM! II I IMIIIIIIMIIIMIIIIII I M 

982 AAG C T CG ATG C AG C CG AG AG AGT A CTG C G C T AC AAC AT GG AC AT C CTG AAG C C AG T TGTG 1041 
661 AGAACACTAAACTCTACATCTCCCTTCCCGAGCAAGGAGCTGGCCG7AAGCCACAAAAACA 72 0 

in 1 1.1 nun 11 mini mini i ii i miii imiimi i i 

104 2 ACAAAACTAAATTCCACATCTCATCTCCCGACCCAGCATCTGGCTGAAGCCACCACAGTG 1101 
721 TTGCTGCATAGTCTTGGGACTCTGGCCCAGGAGCTGTTCAGCATGAGAAGCTGGAGTGAC 78 0 

1 1 1 1 1 I II I Ml MMIMI MIMMMMII I IMIIMI Ml 

TTGCTTGACAGCTTGGGGGGCCTGGCCCAAGAGCTGTTCAGCACAAAGAGCTGGAGCGAC 1161 



1102 



7 81 ATGCGACAGGAGGTGATGTTTCTGACCAATGTGAACAGCTCCAGCTCCTCCACCCAAATC 84 0 

UNI IMIIIMMIIIIIIIMIIII IIIIIMIIMIMIIIimiMII III 

ATGCGGCAGGAGGTGATGTTTCTGACCAACGTGAACAGCTCCAGCTCCTCCACCCAGATC 



1162 



1221 



841 TACCAGGCTGTGTCTCGTATTGTCTGCGGGCATCCCGAGGGAGGGGGGCTGAAGATCAAG 900 

1 1 1 1 1 J f I Mill II II II II II M M IMM Mill IMMMIMM 

122 2 TACCAGGCAGTGTCCCGCATCGTGTGTGGTCACCCAGAGGGTGGGGGCCTGAAGATCAAG 12 81 
901 TCTCTCAACTGGTATGAGGACAACAACTACAAAGCCCTCTTTGGAGGCAATGGCACTGAG 960 

II iniMIMM MIII IIIMIIIIMmillMIIIIIII Ml III III 

1282 TCCCTCAACTGGTACGAGGATAACAACTACAAAGCCCTCTTTGGAGGGAATAACACCGAG 1341 
961 GAAGATGCTGAAACCTTCTATGACAACTCTACAACTCCTTACTGCAATGATTTGATGAAG 102 0 

IIIH I II nillMMIIMI MIMIIIIMIM II I M M II I II I M 1 1 1 

13 4 2 GAAGACGTGGACACCTTCTATGACAATTCTACAACTCCTTATTGCAATGATTTGATGAAG 14 01 
10 21 AATTTGGAGTCTAGTCCTCTTTCCCGCATTATCTGGAAAGCTCTGAAGCCGCTGCTCGTT 10 8 0 

_ 11 iiiiiiiiiiiiiiiiiiii 11 11111 inn n n inn inn in 

14 02 AACTTGGAGTCTAGTCCTCTTTCTCGAATTATTTGGAAGGCACTCAAGCCACTGCTTGTT 1461 



Qy 10 81 GGGAAGATCCTGTATACACCTGACACTCCAGCCACAAGGCAGGTCATGGCTGAGGTGAAC 114 0 

II Mill II IIIIIMMIIIM lllll 

Db 1462 GGAAAGATTCTCTATACACCTGACACACCAGCTACAAGGCAGGTCATGGCTGAGGTGAAC 1521 

Qy 1141 AAGACCTTCCAGGAACTGGCTGTGTTCCATGATCTGGAAGGCATGTGGGAGGAACTCAGC 1200 

Illlllll lllll lllllllllillllll lllll IIMIIIIIII lllllllll 

Db 15 2 2 AAGACCTTTCAGGAGTTGGCTGTGTTCCATGACCTGGAGGGCATGTGGGAAGAACTCAGC 1581 

Qy 12 01 CCCAAGATCTGGACCTTCATGGAGAACAGCCAAGAAATGGACCTTGTCCGGATGCTGTTG 126 0 

Ml I II IIIMIIIIMIIIIIIIIIIIIIII lllllllllillllll I I I I | | 
Db 15 82 CCCCAAATTTGGACCTTCATGGAGAACAGCCAAGAGATGGACCTTGTCCGGACGCTGTTA 1641 

Qy 12 61 GACAGCAGGGACAATGACCACTTTTGGGAACAGCAGTTGGATGGCTTAGATTGGACAGCC 1320 

■in i iiiiiiiii ii iii Minimi 'i mi i mi 

Db 164 2 GACAGCAGAGGCAATGACCAGTTTTGGGAACAGAAGTTGGATGGATTAGATTGGACTGCC 1701 

Qy 1321 CAAGACATCGTGGCGTTTTTGGCCAAGCACCCAGAGGATGTCCAGTCCAGTAATGGTTCT 138 0 

lllllllll Illlllll Illlllll IIIIMI Mil IMIII lllll III 

Db 17 02 CAAGACATCATGGCGTTTCTGGCC AAGAACCCAGAAGATGCTCAGTCCCCAAATGGCTCT 1761 

Qy 13 81 GTGTACACCTGGAGAGAAGCTTTCAACGAGACTAACCAGGCAATCCGGACCATATCTCGC 144 0 

Mill IMIMMMMMMIMI lllll IMIIIIIIIIM III M 1 1 II II 

Db 176 2 GTGTATACCTGGAGAGAAGCTTTCAATGAGACCAACCAGGCAATCCAGACGATATCTCGA 1821 

Qy 14 41 TTCATGGAGTGTGTCAACCTGAACAAGCTAGAACCCATAGCAACAGAAGTCTGGCTCATC 1500 

MMMIMMMMIMIIMMIMM IMIMM I IIIIIMM Illlllll 

Db 18 22 T T C ATGG AG TGTGT C AAC CTG AA C AAG C T GG AAC C C AT T C CG AC AG AAGT C AGG C T CAT C 1881 

Qy 15 01 AACAAGTCCATGGAGCTGCTGGATGAGAGGAAGTTCTGGGCTGGTATTGTGTTCACTGGA 1560 

MIIIIIIIIIIMIIIIMIM MIMMIIIIMIIIIIM II IMIIIII M 

Db 18 82 AACAAGTCCATGGAGCTGCTGGACGAGAGGAAGTTCTGGGCTGGCATCGTGTTCACAGGC 1941 

Qy 1561 AT T ACTC C AGG C AG C ATTG AG C TG C C C CAT C ATG T C AAGT AC AAG AT C C G AATGG A CAT T 1620 

M IIIIMI II I MM IIIMIMM III MM Ml I Mill I lllllllll 

Db 1942 ATCACTCCAGATAGTGTGGAGCTGCCCCATCATGTAAAGTACAAGATCCGGATGGACATT 2001 

Qy 1621 GACAATGTGGAGAGGACAAATAAAATCAAGGATGGGTACTGGGACCCTGGTCCTCGAGCT 16 8 0 

HIM Illlllll II Mill I, M M I,! Ml I Ml 

Db 2 0 02 GACAACGTGGAGAGAACTAATAAGATCAAGGATGGGTACTGGGACCCTGGTCCTCGGGCT 2 061 

Qy 1681 GACCCCTTTGAGGACATGCGGTACGTCTGGGGGGGCTTCGCCTACTTGCAGGATGTGGTG 1740 

Mill lllll M Mill II IMIMM IMIII Mill III IMIMM MM I 

Db 2 06 2 GACCCTTTTGAAGATATGCGCTATGTCTGGGGCGGCTTCGCCTACTTGCAGGATGTGGTG 2121 

Qy 1741 GAGCAGGCAATCATCAGGGTGCTGACGGGCACCGAGAAGAAAACTGGTGTCTATATGCAA 1800 

M Mill IMIMM IMIIIIIMI I II II M M M I IMIMM lllll 

Db 2122 GAACAGGCCATCATCAGAGTGCTGACGGGATCTGAGAAGAAAACGGGTGTCTACGTGCAA 2181 

Qy 18 01 CAGATGCCCTATCCCTGTTACGTTGATGACATCTTTCTGCGGGTGATGAGCCGGTCAATG 1860 

I M I II II I II M I II I M IMMIIMM II M I M M 1 1 MIMMIMIMM 

Db 218 2 CAGATGCCCTACCCCTGTTATGTTGATGACATTTTTCTGCGGGTCATGAGCCGGTCAATG 2241 

Qy 1861 CCCCTCTTCATGACGCTGGCCTGGATTTACTCAGTGGCTGTGATCATCAAGGGCATCGTG 192 0 

I M I II I M II I M II IMIIIII Mill II IMIIIIIIIMIII MM III 

Db 224 2 CCCCTCTTCATGACTCTAGCCTGGATCTACTCTGTCGCTGTGATCATCAAGAGCATTGTG 2 3 01 

Qy 1921 T ATG AG AAGG AGG C A CGG C T G AAAG AG A C C ATG CGG AT C AT GGG C CT GG AC AA C AG CAT C 19 80 

I M I M II I M I II IMIIIII MMMMMMMMIMI IMIMM lllll 

Db 23 02 TATGAGAAGGAGGCTCGGCTGAAGGAGACCATGCGGATCATGGGTCTGGACAATGGCATC 2 361 

Qy 1981 CTCTGGTTTAGCTGGTTCATTAGTAGCCTCATTCCTCTTCTTGTGAGCGCTGGCCTGCTA 2 04 0 

IIMIIIIIIIIMMI MM Illlllll Mill IIIIIIIIMIIIMMIII 

Db 23 62 CTCTGGTTTAGCTGGTTTGTTAGCAGCCTCATCCCTCTGCTTGTGAGCGCTGGCCTGCTG 2 4 21 

Qy 2 041 GTGGTCATCCTGAAGTTAGGAAACCTGCTGCCCTACAGTGATCCCAGCGTGGTGTTTGTC 210 0 

IIMIMM MMMMMIMIMMMIMM IMM MMIMMIMM III 

Db 24 2 2 GTGGTCATCTTGAAGTTAGGAAACCTGCTGCCCTATAGTGACCCCAGCGTGGTGTTCGTC 24 81 

Qy 2101 TTCCTGTCCGTGTTTGCTGTGGTGACAATCCTGCAGTGCTTCCTGATTAGCACACTCTTC 2160 

MMMM MMMM II Mill MIMMIMI IIMMII IMIII 

Db 24 8 2 TTCCTGTCTGTGTTTGCCATGGTGACCATCCTACAGTGCTTCCTCATTAGCACGCTCTTC 2 541 



2220 



2161 TCCAGAGCCAACCTGGCAGCAGCCTGTGGGGGCATCATCTACTTCACGCTGTACCTGCCC 

HI I IIIINIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIK II 

2 54 2 TCCCGTGCCAACCTGGCAGCAGCCTGTGGGGGCATC ATCTACTTCACGCTGTACCTGCCC 26 01 
2 2 21 TACGTCCTGTGTGTGGCATGGC AGGACTACGTGGGCTTCACACTCAAGATCTTCGCTAGC 22 8 0 

H M NMI N N lllllllllll IIIIIMM I llllllllll llllll 

2 6 02 TATGTGCTGTGCGTAGCCTGGCAGGACTATGTGGGCTTCTCCATCAAGATCTTTGCTAGC 2661 
2 2 81 CTGCTGTCTCCTGTGGCTTTTGGGTTTGGCTGTGAGTACTTTGCCCTTTTTGAGGAGCAG 234 0 

IIIIMIIIIIIIIIIIII lllllllllll II Mllllll IMIIIII 

2 66 2 CTGCTGTCTCCTGTGGCTTTTGGATTCGGCTGTGAGTATTTCGCCCTTTTCGAGGAGCAA 2 721 
2341 GGCATTGGAGTGCAGTGGGACAACCTGTTTGAGAGTCCTGTGGAGGAAGATGGCTTCAAT 24 0 0 

M M M II 1 1 Mllllll II Mllllll II Mllllll II IIIIIMM 

2 722 GGTATCGGGGTCCAATGGGACAATCTCTTTGAGAGCCCGGTGGAGGAGGACGGCTTCAAT 2 781 

24 01 CTCACCACTTCGGTCTCCATGATGCTGTTTGACACCTTCCTCTATGGGGTGATGACCTGG 24 6 0 
IIIIIMM I II II M M I I u ii i i i i i i ii ii i t i 

I 1 1 1 1 1 ' f ' ' ' i iiiiiiillll l l I l | || M I I I I M I I M IMIIIII Ml 

2 782 CTCACCACTGCAGTGTCCATGATGCTCTTTGACACCTTTCTCTATGGCGTGATGACATGG 2 841 
24 61 TACATTGAGGCTGTCTTTCCAGGCCAGTACGGAATTCCCAGGCCCTGGTATTTTCCTTGC 2 52 0 

HIM H M MIMMIIM MM MMIMMIMIMMMIIMM 

2 842 TACATCGAAGCCGTCTTTCCAGGACAGTATGGAATTCCCAGGCCCTGGTATTTTCCTTGT 2 901 
2 521 ACCAAGTCCTACTGGTTTGGCGAGGAAAGTGATGAGAAGAGCCACCCTGGTTCCAACCAG 2580 

IMIIIII lllllllllll IMIMI 1 1 1 1 1 II I II I M II 1 1 II I II I II I MM 

2 9 02 ACCAAGTCATACTGGTTTGGTGAGGAAATTGATGAGAAGAGCCACCCTGGTTCCAGCCAG 2 961 
2 5 81 AAGAGAATATCAGAAATCTGCATGGAGGAGGAACCCACCCACTTGAAGCTGGGCGTGTCC 264 0 

III 1 1 I lllllllllllllllll IIMMIMM M III llllll llllll 

2 962 AAGGGAGTGTCAGAAATCTGCATGGAAGAGGAACCCACTCATCTGAGGCTGGGGGTGTCC 3 021 
2641 ATTCAGAACCTGGTAAAAGTCTACCGAGATGGGATGAAGGTGGCTGTCGATGGCCTGGCA 2 70 0 

UN Ml Mill I I II II Mill MIMI Mllllll HIM MMM MM 

3 02 2 ATTCAGAACCTGGTGAAGGTTTACCGAGATGGCATGAAGGTTGCTGTGGATGGCTTGGCG 3 081 

27 01 CTGAATTTTTATGAGGGCCAGATCACCTCCTTCCTGGGCCACAATGGAGCGGGGAAGACG 2 760 

I I II Mill II IMIIIII I II II I I I II II I M I II II M M II Mllllll 

3 0 82 CTCAACTTTTACGAAGGCCAGATTACCTCCTTCCTGGGCCACAATGGAGCAGGGAAGACC 3141 

2 761 ACCACCATGTCAATCCTGACCGGGTTGTTCCCCCCGACCTCGGGCACCGCCTACATCCTG 2 82 0 

llllllllllllll Mill Ml MM HIM M II IIIM MIMMIMM 

314 2 ACCACCATGTCAATACTGACTGGGCTGTTTCCCCCAACTTCTGGCACGGCCTACATCCTG 3201 
2 821 GGAAAAGACATTCGCTCTGAGATGAGCACCATCCGGCAGAACCTGGGGGTCTGTCCCCAG 2 8 8 0 

M II IMMMIMI MIIIIMI MIMMMMMMMM Mllllllllll 

32 02 GGGAAGGACATTCGCTCGGAGATGAGCTCCATCCGGCAGAACCTGGGAGTCTGTCCCCAG 3 261 
2 8 81 CATAACGTGCTGTTTGACATGCTGACTGTCGAAGAACACATCTGGTTCTATGCCCGCTTG 2 94 0 

Mill MMIMMMMIMMIIMMIMM lllllllllllllllll III I 

32 62 CATAATGTGCTGTTTGACATGCTGACTGTCGAAGAGCACATCTGGTTCTATGCGCGCCTA 3 321 

2 941 AAAGGGCTCTCTGAGAAGCACGTGAAGGCGGAGATGGAGCAGATGGCCCTGGATGTTGGT 3 0 00 

M MIMMI MMMMMIMI II IIMMMMIMMMIMMMIMM 

3 3 2 2 AAGGGGCTCTCAGAGAAGCACGTGAAAGCAGAGATGGAGCAGATGGCCCTGGATGTTGGC 3381 
3 0 01 TTGCCATCAAGCAAGCTGAAAAGCAAAACAAGCCAGCTGTCAGGTGGAATGCAGAGAAAG 3 060 

M M I I M 1 1 II II I 1 1 II II II II II Mill Mllllll MMIIIIMM 

3 3 82 TTACCCCCGAGCAAGCTGAAAAGCAAAACGAGTCAGCTCTCAGGTGGGATGCAGAGAAAG 3441 
3 0 61 CTATCTGTGGCCTTGGCCTTTGTCGGGGGATCTAAGGTTGTCATTCTGGATGAACCCACA 3120 

M lllllllllllllllll II II HIM MMIMIMMIIMI II I 

3 44 2 CTGTCTGTGGCCTTGGCCTTCGTGGGTGGATCCAAGGTTGTCATTCTGGACGAGCCCACA 3 5 01 
3121 GCTGGTGTGGACCCTTACTCCCGCAGGGGAATATGGGAGCTGCTGCTGAAATACCGACAA 318 0 

M 1 1 Mllllll Mill IMMIMMMMMI II Mill MIMMI III 

3 5 02 GCCGGGGTGGACCCGTACTCTCGCAGGGGAATATGGGAACTCCTGCTAAAATACCGGCAA 3 561 
3181 GGCCGCACCATTATTCTCTCTACACACCACATGGATGAAGCGGACGTCCTGGGGGACAGG 3 2 4 0 

MMMMMIMM I IMIIIIIIIIIIIIM Mill Ml MM Mllllll 



3 562 GGCCGCACCATTATTTTGTCTACACAGCACATGGACGAAGCTGACATCCTTGGGGACAGA 3621 
32 41 ATTGCCATCATCTCCCATGGGAAGCTGTGCTGTGTGGGCTCCTCCCTGTTTCTGAAGAAC 3 3 0 0 

""""III II IIIIIMIII IIIIIIIMIIIIIIIIIII 



3 6 22 ATTGCCATCATTTCCCATGGGAAGCTGTGTTGTGTGGGCTCCTCCCTGTTTTTGAAAAAC 3 6 81 

33 01 CAGCTGGGAACAGGCTACTACCTGACCTTGGTCAAGAAAGATGTGGAATCCTCCCTCAGT 3 3 6 0 

III I I I I I I I II I I I I I I I I I I I I | | | I I I | | | | | | | | | I | | M II I I I I I I I 
3 682 CAGTTGGGAACGGGTTACTATCTGACCCTGGTTAAGAAAGATGTGGAATCGTCCCTCAGT 3 741 

3361 TCCTGCAGAAACAGTAGTAGCACTGTGTCATACCTGAAAAAGGAGGACAGTGTTTCTCAG 34 2 0 



IJIjlMII Mill Hill I lllllllllllilllllllllllll 



3 74 2 TCCTGCAGAAACAGTAGCAGCACCGTGTCTTGTCTGAAAAAGGAGGACAGTGTTTCTCAG 3 8 01 
34 21 AGCAGTTCTGATGOT 3 4 8 0 

3802 



iiiiiiiiimmiiiiiiiiiiiiiiiii iiiiiiiiiiiiiiimimiiii 

AGCAGTTCTGATGCTGGCCTGGGCAGCGACCATGAAAGTGACACGCTGACCATCGATGTC 3 861 



3481 

" " 1 X ' 1 1 ' ' 1 1 1 1 ' ' 1 1 1 ' ' 1 1 ' f ii i minium ii ii i iii ii 1 1 



3862 TCTGCTATCTCCAACCTCATCAGGAAGCACGTGTCTGAAGCCCGGCTGGTGGAGGACATT 3921 
3 541 GGGCATGAGCTGACCTATGTGCTGCCATATGAAGCTGCTAAGGAGGGAGCCTTTGTGGAA 3 60 0 



1 1 [ M 1 1 1 II Mill M llllMi 



3 922 GGGCACGAGCTGACCTATGTGCTGCCGTACGAAGCCGCGAAGGAGGGAGCCTTTGTGGAA 3 981 
3 601 CTCTTTCATGAGATTGATGACCGGCTCTCAGACCTGGGCATTTCTAGTTATGGCATCTCA 3 66 0 



3 98 2 CTCTTCCATGAGATTGATGACCGGCTCTCAGACCTGGGCATCTCCAGTTATGGCATCTCG 4 041 



3 661 GAGACGACCCTGGAAGAAATATTCCTCAAGGTGGCCGAAGAGAGTGGGGTGGATGCTGAG 
Hill ' Mill IIIMIII IIIIIIMIIMIII 

GAGACCACCCTGGAAGAAATATTCCTCAAAGTGGCTGAAGAGAGCGGGGTGGATGCTGAG 



4042 



3 721 ACCTCAGATGGTACCTTGCC AGC AAGACGAAACAGGCGGGCCTTCGGGGAC AAGC AGAGC 
millllllllll limillllllllllllll IIIIIIIIIIIIMMIIIIIIII 
ACCTCAGATGGTACTTTGCCAGCAAGACGAAACAGACGGGCCTTCGGGGACAAGCAGAGC 



4102 



3720 
4101 
3780 
4161 



3 781 TGTCTTCGCCCGTT 3 84 0 

4162 TGTCTGCACCCATTTACGGAAGATGATGCTGTTGATCCCAATGACTCT 4 2 2 1 

3841 GAATCCAGAGAGACAGACTTGCTCAGTGGGATGGATGGCAAAGGGTCCTACCAGGTGAAA 3 90 0 

mum mil mi mmiiiimiimii imiimi iimimm mm 

42 2 2 GAATCCAGGGAGACCGACCTGCTCAGTGGGATGGACGGCAAAGGCTCCTACCAGCTGAAG 42 81 
3901 GGCTGGAAACTTACACAGCAACAGTTTGTGGCCCTTTTGTGGAAGAGACTGCTAATTGCC 3 960 



42 82 GGCTGGAAACTCACCCAGCAACAGTTTGTGGCCCTTTTGTGGAAGAGGCTGCTGATTGCC 4 341 



3 961 AGACGGAGTCGGAAAGGATTTTTTGCTCAGATTGTCTTGCCAGCTGTGTTTGTCTGCATT 

I f / 1 1 1 1 1 Mill II II MMMMMIIIM IMIIMIII Mill III 

4 34 2 AGACGGAGCCGGAAGGGTTTCTTTGCTCAGATTGTCCTGCCAGCTGTCTTTGTTTGCATT 



4402 



4020 
4401 
4080 
4461 
4140 

44 62 CCCTGGATGTATAATGAGCAGTATACATTTGTCACT 4521 
4141 ACCCTGGAACTCTTAAACGCCCTCACCAAAGACCCTGGCTTCGGGACCCGCTGTATGGAA 4 2 0 0 



4 0 21 GCCCTTGTGTTCAGCCTGATCGTGCCACCCTTTGGCAAGTACCCCAGCCTGGAACTTCAG 

lim II IIMIIIMM IIMIMMIMMMMMMMMMIMMIM,,, 

GCCCTGGTCTTCAGCCTGATTGTGCCACCCTTTGGCAAGTACCCCAGCCTGGAACTTCAG 



4 0 81 CCCTGGATGTACAACGAACAGTAC 



ii i ii ii ii miiiii ii inn Miiiiiiiiii 



4 52 2 ACCCAGGAACTCCTGAATGCTCTGACCAAAGATCCAGGCTTTGGGACCGGCTGTATGGAA 4 5 81 
4 2 01 GGAAACCCAATCCCAGACACGCCCTGCCAGGCAGGGGAGGAAGAGTGGACCACTGCCCCA 4 2 60 

IMIIMMIII II M Ml Ml IMIIMI II MIIIII III 

4 5 8 2 GGAAACCCAATCCCAGATACCCCTTGCTTGGCTGGGGAGGAGGACTGGACCATCAGCCCC 4 641 



4 2 61 GTTCCCCAGACCATCATGGACCTCTTCCAGAATGGGAACTGGACAATGCAGAACCCTTCA 



4320 



Db 



Qy 



Db 



Qy 



Db 



II I ) 1 1 1 1 1 INI lllllllllllllllllll llllllll Ml llllin ill 

4 64 2 GTCCCCCAGAGCATCGTGGACCTCTTCCAGAATGGAAACTGGACCATGAAGAACCCCTCA 4 701 
4 3 21 CCTGCATGCCAGTGTAGCAGCGACAAAATCAAGAAGATGCTGCCTGTGTGTCCrCCAGGG 4 3 8 0 

HIM IIIIIIIIIIIIM IIIIIIMIMIIIIIIIIIIIMIIMIIIIlMllll 

4 70 2 CCTGCGTGCCAGTGTAGCAGTGACAAAATCAAGAAGATGCTGCCTGTGTGTCCCCCAGGG 4 761 

4381 GCAGGGGGGCTGCCTCCTCCAC 4402 

MIIIIMIIIIII Mill I 
4 76 2 GCAGGGGGGCTGCCACCTCCTC 4 7 83 



SEQ ID NO: 2 



Result 



Query 



No . 


Score 


Match 


Length 


DB 


ID 


Description 


1 


11469 


100 


. 0 


22 01 


21 


AAY79380 


Human ATP binding 


2 


11469 


100 


. 0 


2201 


22 


AAE13021 


Human ATP binding 


3 


11469 


100 


. 0 


2201 


22 


AAM50227 


Human ATP binding 


4 


11469 


100 


. 0 


2201 


23 


ABP65164 


Hypoxia- regulated 


5 


11469 


100 


. 0 


2261 


22 


AAE13022 


Human ATP binding 


6 


11469 


100 


. 0 


2261 


22 


AAM50228 


Human ATP binding 


7 


11469 


100 


. 0 


2261 


22 


AAU02176 


Human ABC1 . Homo 


8 


11469 


100 


. 0 


2261 


22 


AAU02177 


Human ABC1 mutant 


9 


11469 


100 


. 0 


2261 


23 


ABB83111 


Human ABCA1 transp 


10 


11469 


100 


. 0 


2261 


23 


ABP52092 


Homo sapiens ABC t 


11 


11469 


100 


. 0 


2261 


23 


AAE23000 


Human ABC1 full-le 


12 


11468 


100 


. 0 


2261 


23 


ABB83117 


Polymorphic human 


13 


11468 


100 


.0 


2261 


23 


ABB83124 


Polymorphic human 


14 


11466 


100 


. 0 


2261 


22 


AAU02183 


Human ABC1 mutant 


15 


11466 


100 


. 0 


2261 


22 


AAU02188 


Human ABC1 mutant 


16 


11466 


100 


. 0 


2261 


23 


ABB83115 


Polymorphic human 


17 


11466 


100 


0 


2261 


23 


ABB83116 


Polymorphic human 


18 


11466 


100 


0 


2261 


23 


ABB83119 


Polymorphic human 


19 


114 66 


100 


0 


2261 


23 


AfciBo3 122 


Polymorphic human 


20 


11466 


100 


0 


2261 


23 


ABB83123 


Polymorphic human 


21 


11465 


100 


0 


2261 


22 


AAU02189 


Human ABC1 mutant 


22 


11464 


100 


0 


2261 


23 


ABB83121 


Polymorphic human 


23 


11463 


99 


9 


2261 


22 


AAU02181 


Human ABC1 mutant 


24 


11462 


99. 


9 


2261 


22 


AAM78550 


Human protein SEQ 


25 


11462 


99. 


9 


2263 


22 


ABB11956 


Human ABCA1 homolo 


26 


11462 


99. 


9 


2263 


22 


AAM79534 


Human protein SEQ 


27 


11461 


99. 


9 


2261 


22 


AAU02182 


Human ABC1 mutant 


28 


11461 


99 . 


9 


2261 


22 


AAU02186 


Human ABC1 mutant 


29 


11459 


99. 


9 


2261 


23 


ABB83118 


Polymorphic human 


30 


11458 


99 . 


9 


2261 


23 


ABB83120 


Polymorphic human 


31 


11440 


99. 


7 


2261 


21 


AAB3 8 0 82 


Human ABC1 cholest 


32 


11440 


99 . 


7 


2261 


22 


AAB71749 


Human ABC1 protein 


33 


11440 


99 . 


7 


2261 


24 


ABU11899 


Human ATP-binding 


34 


11439 


99. 


7 


2261 


23 


ABB81578 


Human ABC -A- 1-1 pr 


35 


11437 


99 . 


7 


2261 


21 


AAB38109 


Human ABC1 cholest 


36 


11437 


99. 


7 


2261 


21 


AAB38111 


Human ABC1 cholest 


37 


11437 


99 . 


7 


2261 


21 


AAB38114 


Human ABC1 cholest 


38 


11437 


99. 


7 


2261 


21 


AAB38115 


Human ABC1 cholest 


39 


11437 


99 . 


7 


2261 


21 


AAB38117 


Human ABC1 cholest 


No. 


Score 


Match 


Length DB 


ID 


Description 



1 


6909 


60 


2 


1375 


3 


US- 


08 


-665 


-259- 


26 


2 


6909 


60 


2 


1375 


3 


US- 


08 


-762 


-500- 


26 


3 


3129 . 5 


27 


3 


1457 


3 


US- 


08 


-665 


-259- 


27 


4 


3129.5 


27 


3 


1457 


3 


US- 


08 


-762 


-500- 


27 


5 


2635.5 


23 


0 


1684 


3 


us- 


08 


-665 


-259- 


25 


6 


2635 . 5 


23 


0 


1684 


3 


US- 


08 


-762 


-500- 


25 


7 


2635 . 5 


23 


0 


1704 


3 


US- 


08 


-762 


-500- 


75 



Sequence 26, Appl 

Sequence 26, Appl 

Sequence 27, Appl 

Sequence 27, Appl 

Sequence 25, Appl 

Sequence 25, Appl 

Sequence 75, Appl 



8 


359 


3 


. 1 


315 


4 


US 


09-328 


-352-4388 


9 


354 


3 


1 


332 


4 


us- 


-09-107 


-532A-3752 


10 


352 . 5 


3 


1 


335 


4 


us- 


-09-252 


- 991A-20837 


11 


352 


3 


1 


322 


4 


us- 


09-107 


-532A-4662 


12 


346 


3 


0 


589 


4 


us- 


09-328 


-352-7592 


13 


336 . 5 


2 


9 


1280 


2 


us- 


08-583 


-276-19 


14 


335 . 5 


2 


9 


291 


4 


us- 


09-107 


-532A-4205 


15 


335.5 


2 


9 


1280 


4 


us- 


09-767 


-594-2 


16 


335 . 5 


2 


9 


1280 


6 


5206352-4 




17 


333 


2 


9 


1279 


2 


us- 


08-784 


-649A-2 


18 


332 


2 . 


9 


402 


4 


us- 


09-107 


-532A-5360 


19 


330 . 5 


2 . 


9 


1280 


2 


us- 


08-752 


-447-2 



Sequence 43 88, Ap 
Sequence 3752, Ap 
Sequence 20837, A 
Sequence 4662, Ap 
Sequence 75 92, Ap 
Sequence 19, Appl 
Sequence 4 205, Ap 
Sequence 2, Appli 
Patent No. 5206352 
Sequence 2, Appli 
Sequence 5360 , Ap 
Sequence 2, Appli 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



RESULT 1 

US-08-665-259-26 

Sequence 26, Application US/08665259 
Patent No. 6 028173 
GENERAL INFORMATION: 

APPLICANT: Landes, Gregory M. 

Burn, Timothy C. 
Connors, Timothy D. 
Dackowski, William R. 
Van Raay, Terence J. 
Klinger, Katherine W. 
TITLE OF INVENTION : NOVEL HUMAN CHROMOSOME 16 GENES, 

TITLE OF INVENTION: COMPOSITIONS, METHODS OF MAKINg'aND USING SAME 
NUMBER OF SEQUENCES : 73 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: GENZYME CORPORATION 
STREET: One Mountain Road 
CITY: Framingham 
STATE: Massachusetts 
COUNTRY: United States of America 
ZIP: 01701 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 
SOFTWARE: Patent In Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/665,259 
FILING DATE : 17-JUN-1996 
CLASSIFICATION: 435 
ATTORNEY/ AGENT INFORMATION: 
NAME: Dugan, Deborah A. 
REGISTRATION NUMBER: 37,315 
REFERENCE/DOCKET NUMBER: IG5-9.1 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (508) 872-8400 
TELEFAX: (508) 872-5415 
INFORMATION FOR SEQ ID NO: 26: 
SEQUENCE CHARACTERISTICS: 
LENGTH : 13 7 5 amino acids 
TYPE: amino acid 
STRANDEDNESS : not relevant 
TOPOLOGY: unknown 
MOLECULE TYPE: protein 
US-08-665-259-26 

Query Match 60.2%; Score 6909; DB 3; Length 1375; 

Best Local Similarity 96.9%; Pred. No . 0 ; 

Matches 1332; Conservative 21; Mismatches 22; Indels 0; Gaps 0; 
Qy 827 CMEEEPTHLKLGVSIQNLVKVYRDGMKVAVDGLALNFYEGQITSFLGHNGAGKTTTMSIL 886 

Db 1 iiiiiiiihiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiMiiiiiimiiiiii 

1 CMEEEPTHLRLGVSIQNLVKVYRDGMKVAVDGLALNFYEGQITSFLGHNGAGKTTTMSIL 60 

946 



QY 887 TGLFPPTSGTAYILGKDIRSEMSTIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEK 

I lllliMMIIIIhllllillllllliliiiiiiiiiiiiiiHiiiiii 



Db 61 TGLFPPTSGTAYILGKDIRSEMSSIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEK 120 

Qy 947 HVKAEMEQMALDVGLPSSKLKSKTSQLSGGMQRKLSVALAFVGGSKWILDEPTAGVDPY 1006 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | 
Db 121 HVKAEMEQMALDVGLPPSKLKSKTSQLSGGMQRKLSVALAFVGGSKWILDEPTAGVDPY 180 

Qy 1007 SRRGIWELLLKYRQGRTI ILSTHHMDEADVLGDRIAI ISHGKLCCVGSSLFLKNQLGTGY 1066 

, I I I M I I I I I I I I I I I I | | | | | | | | | | | | : | | | | | | | | | | | | | | | | | | | j | | | | | | | | | | 

Db 181 SRRGIWELLLKYRQGRTI ILSTHHMDEADILGDRIAI ISHGKLCCVGSSLFLKNQLGTGY 240 

Qy 1067 YLTLVKKDVESSLSSCRNSSSTVSYLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNL 1126 

IMIIMIIIIIIMIIIMim ' 

Db 24 1 YLTLVKKDVESSLSSCRNSSSTVSCLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNL 300 

Qy 112 7 IRKHVSEARLVEDIGHELTYVLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEE 118 6 

Db 301 IRKHVSEARLVEDIGHELTYVLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEE 360 

OV 1187 IFTiKVARRSCTnaRTRnnTT.DAPDMDDflirnriU'nCPT TD pUTirnn?! 1\ nnnnnnmrmnnnmr, ^ .. ^ 
~-i ■ niT.iu.ni.rUit u^rv^kjv,jji\i: r i iLiJunniJri^JJOLIXUt'CjOA.ril JJ 

IIMIIIIIMIIIMIIIIIMMIIMIIIMII lllllll IIIMIIIIMIIII 

Db 361 IFLKVAEESGVDAETSDGTLPARRNRRAFGDKQSCLHPFTEDDAVDPNDSDIDPESRETD 420 

Qy 1247 LLSGMDGKGSYQVKGWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSL 1306 

Db 421 LLSGMDGKGSYQLKGWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSL 4 80 

07 !3 07 IVPPFGKYPSLELQPWMYNEQYTFVSNDAPEDTGTLELLNALTKDPGFGTRCMEGNPIPD 1366 

IIIIIIIMMIIIIIIIIIIIMIIIIIIII I! IIIIIMIMIIIIIIMMMII 

Db 4 81 IVPPFGKYPSLELQPWMYNEQYTFVSNDAPEDMGTQELLNALTKDPGFGTRCMEGNPIPD 54 0 

Qy 1367 TPCQAGEEEWTTAPVPQTIMDLFQNGNWTMQNPSPACQCSSDKIKKMLPVCPPGAGGLPP 1426 

HI Mll = ll UII|:|:||||||M||:|||||||||||!MIIIII||||MMII 
Db 541 TPCLAGEEDWTISPVPQSIVDLFQNGNWTMKNPSPACQCSSDKIKKMLPVCPPGAGGLPP 600 

Qy 1427 PQRKQNTADILQDLTGRNISDYLVKTYVQIIAKSLKNKIWVNEFRYGGFSLGVSNTQALP 1486 

1 1 1 1 1 1 1 1 1 1 M I M ! I I I I I I I I I I M | | | | I | I | | | | | | | | | | | | | | | I | | : | | | I 
Db 601 PQRKQKTADILQNLTGRNISDYLVKTYVQIIAKSLKNKIWVNEFRYGGFSLGVSNSQALP 660 

Qy 1487 PSQEVNDATKQMKKHLKLAKDSSADRFLNSLGRFMTGLDTRNNVKVWFNNKGWHAISSFL 1^46 

II IMII I II 1 1 III i 1 1 II --11111111 III II II Nil 

DO 661 PSHEVNDAI KQMKKLLKLTKDTSADRFLSSLGRFMAGLDTKNNVKVWFNNKGWHAI SSFL 72 0 

Qy 154 7 WINNAI LRANLQKGENPSHYGITAFNHPLNLTKQQLSEVAPMTTS VDVLVS I CVI FAMS 1606 

IIIIMIMIIIMIIIII IIIIMIIIIIIIIIIIIMI 

Db 721 NVINNAILRANLQKGENPSQYGITAFNHPLNLTKQQLSEVALMTTSVDVLVSICVIFAMS 780 

Qy 1607 FVPASFWFLIQERVSKAKHLQFISGVKPVIYWLSNFVWDMCNYWPATLVIIIFICFQQ 1666 

NIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIllllllllliiiiiM 

Db 781 FVPASFWFLIQERVSKAKHLQFISGVKPVIYWLSNFVWDMCNYWPATLVIIIFICFQQ 840 

Qy 16 67 KSYVSSTNLPVLALLLLLYGWSITPLMYPASFVFKIPSTAYVVLTSVNLFIGINGSVATF 172 6 

MIIIIIIIMMIIIIIIIIIIIIIIIIIIMIIMIMIMIIIIIIIIIIIIIIIM 

Db 841 KSYVSSTNLPVLALLLLLYGWSITPLMYPASFVFKIPSTAYWLTSVNLFIGINGSVATF 900 

Qy 1727 VLELFTDNKLNNINDILKSVFLIFPHFCLGRGLIDMVKNQAMADALERFGENRFVSPLSW 1786 

Db 901 VLELFTNNKLNDINDILKSVFLIFPHFCLGRGLIDMVKNQAMADALERFGENRFVSPLSW 960 

Qy 17 8 7 DLVGRNLFAMAVEGWFFLITVLIQYRFFIRPRPVNAKLSPLNDEDEDVRRERQRILDGG 184 6 

iiiMiiMimiiiiMiiimiiiiiiim in iiiiiiiiiiniiiiiiii 

Db 961 DLVGRNLFAMAVEGWFFLITVLIQYRFFIRPRPVKAKLPPLNDEDEDVRRERQRILDGG 1020 

Qy 1847 GQNDILEIKELTKIYRRKRKPAVDRICVGIPPGECFGLLGVNGAGKSSTFKMLTGDTTVT 1906 

IIIIIIIMIIIIIIIII|||||||||:|||||||||||M||||||:||||||||| M 
Db 1021 GQNDILEIKELTKIYRRKRKPAVDRICIGIPPGECFGLLGVNGAGKSTTFKMLTGDTPVT 1080 

Qy 1907 RGDAFLNRNSILSNIHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKVGE 1966 

IIIIIMNMIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMMIIIIIIII M 

1081 RGDAFLNKNSILSNIHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKFGE 1140 
Qy 196 7 WAIRKLGLVKYGEKYAGNYSGGNKRKLSTAMALIGGPPWFLDEPTTGMDPKARRFLWNC 2 02 6 



Db ii4i 
Qy 2027 



MMMMMilllll llllllllllMllllllllllliiiiiiiiiiiiiiinin 

WAIRKLGLVKYGEKYASNYSGGNKRKLSTAMALIGGPPWFLDEPTTGMDPKARRFLWNC 



1200 



ALSWKEGRSWLTSHSMEECEALCTRMAIMVNGRFRCLGS VQHLKNRFGDGYTI WRIA 2 0 8 6 

Ml = 11 III 1 1 llll II 1 1 II I Ml II II III INN Mill || I MM Hi |M 

Db 1201 ALSIVKEGRSWLTSHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIWRIA 1260 

Qy 2087 GSNPDLKPVQDFFGLAFPGSVPKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDY 2146 

IMIIIIIIhll Ill 

Db 1261 GSNPDLKPVQEFFGLAFPGSVLKEK^iRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDY 1320 

Qy 214 7 SVSQTTLDQVFVNFAKDQSDDDHLKDLSLHKNQTWDVAVLTSFLQDEKVKESYV 2201 

1 1 M I I I I M I I I I I I I II I I II I I I I II I I I I I | | | || II I I I I I I I I I I | | | | 
Db 1321 SVSQTTLDQVFVNFAKDQSDDDHLKDLSLHKNQTWDVAVLTSFLQDEKVKESYV 1375 



RESULT 2 

US-08 -762-500-26 

Sequence 26, Application US/08762500 
Patent No. 6030806 
GENERAL INFORMATION: 

APPLICANT : Landes, Gregory M. 
APPLICANT: Burn, Timothy C. 
APPLICANT: Connors, Timothy D. 
APPLICANT: Dackowski, William R. 
APPLICANT: Van Raay, Terence J. 
APPLICANT: Klinger, Katherine W. 

TITLE OF INVENTION: NOVEL HUMAN CHROMOSOME 16 GENES , 

TITLE OF INVENTION: COMPOSITIONS, METHODS OF MAKING ' AND USING SAME 
NUMBER OF SEQUENCES: 8 3 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: GENZYME CORPORATION 
STREET: One Mountain Road 
CITY : Framingham 
STATE: Massachusetts 
COUNTRY: United States of America 
ZIP: 01701 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC- DOS /MS -DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/762 , 500 
FILING DATE: 09-DEC-1996 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/665,259 
FILING DATE: 17-JUN-1996 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/US96/104 6 9 
FILING DATE: 17-JUN-1996 
ATTORNEY/AGENT INFORMATION: 
NAME: Dugan, Deborah A. 
REGISTRATION NUMBER: 37,315 
REFERENCE /DOCKET NUMBER: IG5-9.3 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (508) 872-8400 
TELEFAX: (508) 872-5415 
INFORMATION FOR SEQ ID NO : 26: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 13 7 5 amino acids 
TYPE: amino acid 
STRANDEDNESS : not relevant 
TOPOLOGY: unknown 
MOLECULE TYPE: protein 
US-08-762-500-26 



Query Match 



60.2%; Score 6909; DB 3; Length 1375; 



Best Local Similarity 96.9%; Pred . No. 0; 

Matches 1332; Conservative 21; Mismatches 22; Indels 0; Gaps 0; 

827 CMEEEPTHLKLGVSIQNLVKVYRDGMKVAVDGLALNFYEGQITSFLGHNGAGKTTTMSIL 886 

I I I i I I M I : I I I I I I I I I I | | | | | | | ] | | | | | | | | | | | | | | | | | | | |||,| | | 

Db 1 GMEEEPTHLRLGVSIQNLVKVYRDGMKVAVDGLALNFYEGQITSFLGHNGAGKTTTMSIL 60 

^ 887 TGLFPPTSGTAYILGKDIRSEMSTIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEK 94 6 

c I I I I I I I I I I I I I I I I I I I I I | | : | | | | | | | | | | | | [ | | | | | | | | | | | | | | || | 

Db 61 TGLFPPTSGTAYILGKDIRSEMSSIRQNLGVCPQHWLFDMLTVEEHIWFYARLKGLSEK 120 

Q y 947 HVKAEMEQMALDVGLPSSKLKSKTSQLSGGMQRKLSVALAFVGGSKWILDEPTAGVDPY 1006 

I I I i I I I I I I I I I I I I | | | | | | | | M | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

Db 121 HVKAEMEQMALDVGLPPSKLKSKTSQLSGGMQRKLSVALAFVGGSKWILDEPTAGVDPY 



180 



Qy 1007 SRRGIWELLLKYRQGRTIILSTHHMDEADVLGDRIAIISHGKLCCVGSSLFLKNOLGTGY 1066 

l0 nil iiiiMiiiiiiiiiiiiihiiiimiiiiii in nun i ii 

Db 181 SRRGIWELLLKYRQGRTIILSTHHMDEADILGDRIAIISHGKLCCVGSSLFLKNQLGTGY 240 

^ 1067 YLjTLVK KDVESSLSSCRNSSSTVSYLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNL 1126 

0 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I | M I I I I I I I I I | | ! M I I 
Db 241 YLTLVKKDVESSLSSCRNSSSTVSCLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNL 300 

Qy 112 7 IRKHVSEARLVEDIGHELTYVLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEE 1186 

I N I I M I I I I I I | | | | | | | | | | | | | | | | | | I I I I I | | | | | | I I I I I I I I I I I I I I I I I I 

Db 301 IRKHVSEARLVEDIGHELTYVLPYEAAKEGAFVELFHEIDDRLSDLiGISSYGISETTLEE 360 

Qy 1187 IFLKVAEESGVDAETSDGTLPARRNRRAFGDKQSCLRPFTEDDAADPNDSDIDPESRETD 1246 

_ HI 1 1 II I III || 1 1| Ml III III III I II II 1 1 III I II I II I II I II Ml 

Db 361 IFLKVAEE SGVDAETSDGTLPARRNRRAFGDKQSCLHPFTEDDAVDPNDSDIDPESRETD 42 0 

QY 1247 LLSGMDGKGSYQVKGWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIAItVFSL 1306 

I I M 1 1 1 1 I II I : II 1 1 M II I M I II I M II I I I | | | | ! || I I I I I II I I II II I II I I 

Db 421 LLSGMDGKGSYQLKGWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSL 48 0 

Qy 1307 IVPPFGKYPSLELQPWMYNEQYTFVSNDAPEDTGTLELLNALTKDPGFGTRCMEGNPIPD 1366 

1 1 1 1 1 1 i 1 1 ! I I I I I II I I I I | | | | | | | | | | I || | || II II I II I I II I I | || | | | | | 
Db 481 IVPPFGKYPSLELQPWMYNEQYTFVSNDAPEDMGTQELLNALTKDPGFGTRCMEGNPIPD 540 

Qy 1367 TPCQAGEEEWTTAPVPQTIMDLFQNGNWTMQNPSPACQCSSDKIKKMLPVCPPGAGGLPP 1426 

IN IliMI =III|:|:||||||||||:||||||||IIIIIMIIMMIIIIIIII 
Db 541 TPCLiAGEEDWTISPVPOSITOLFONGMWTMKMP.QDannraQnvTTrTrMT nunnnr^PT ™ 6Q0 



' M I I ■ I I • I I I I * i ■ I II I I I M | I : | | | I | 1 | | | | | | | | | | | [ [ | | j j | | | | | [ 

TPCLAGEEDWTISPVPQSIVDLFQNGNWTMKNPSPACQCSSDKIKKMLPVCPPGAGGLPP 

^QRKQNTADILQDLTGRNISDYLVKTYVQIIAKSLKNKIWVNEFRYGGFSLGVSNTQALE 

Db 601 n 1 1 1 him j i : j iiiiiiijj iiijj.il | j in i ii i ii I I I I I : I I I I 



! ' 1 I I If I ■ I I I I I I It I I I i I I I I I I I i f I I I | | | | II f | | | | | | | | | | | : | | | | 

PQRKQKTADILQNLTGRNISDYLVKTYVQIIAKSLKNKIWVNEFRYGGFSLGVSNSQALP 660 

Qy 14 37 PSQE VNDATKQMKKHLKLAKDS SADRFLNS LGRFMTGLDTRNNVKVWFNNKGWHAI S S FL 154 6 

_ 'I III II II 11 = 11 II I hi III 1 1 lllhllMIIIIIIIIIMMII 

661 D^HUWnS TK-nvyiTT T mu-nmf^nnr,T r,nr 



11 1 i i i i ii : 1 I I I I I : I I I I I I ! i M : I I I I I I M I I f M I | 

P S HE VND A I KQ M K KL L KLT KD T S ADR F L S S LG R FMAG LD T KNN VK VW FNNKG WH A I 



SSFL 720 



Db 

Qy 

Db 

Qy 1667 



Qy 1547 NVINNAILRANLQKGENPSHYGITAFNHPLNLTKQQLSEVAPMTTSVDVLVSICVIFAMS 1606 

IIIIIIIIIIIHIIIIII IIIIMIIIMMMIIIIII 

721 NVINNAILRANLQKGENPSQYGITAFNHPLNLTKQQLSEVALMTTSVDVLVSICVIFAMS 780 
Qy 1607 FVPASFWFLIQERVSKAKHLQFISGVKPVIYWLSNFVWDMCNYWPATLVIIIFICFQQ 1666 

f 1 1 1 M 1 1 1 1 f 1 1 [ f| II 1 1 1 1 1 1 1 M 1 1 1 f [ 1 1 1 1 1 1 [ M 1 1 [[ 1 1 1 M 1 1 1 1 1 1 1 1 1 1 

81 FVPASFWFLIQERVSKAKHLQFISGVKPVIYWLSNFVWDMCNYWPATLVIIIFICFQQ 840 

KSYVSSTNLPVLALLLLLYGWSITPLMYPASFVFKIPSTAYWLTSVNLFIGINGSVATF 1726 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I II I II I II I I I I M | || I I I I 
Db 841 KSYVSST NLPVLALLLLLYGWSITPLMYPASFVFKIPSTAYWLTSVNLFIGINGSVATF 900 

Qy 1727 VLELFTDNKLNNINDILKSVFLIFPHFCLGRGLIDMVKNQAMADALERFGENRFVSPLSW 1786 

J Jl "I ; l II llllllllllllllllllll III I II I III II Ml MM II MMII 1 1 

Db 901 VLELFTNNKLNDINDILKSVFLIFPHFCLGRGLIDMVKNQAMADALERFGENRFVSPLSW 960 

Qy 1787 DLVGRNLFAMAVEGWFFLITVLIQYRFFIRPRPWAKLSPLNDEDEDVRRERQRILDGG 1846 

Q „ I 1 1 1 1 1 1 1 1 1 1 1 1 M II I I II I II I I I || I I II I I Ml I I II I I I I I M I | | | || | | | 
Db 961 DLVGRNLFAMAVEGWFFLITVLIQYRFFIRPRPVKAKLPPLNDEDEDVRRERQRILDGG 1020 



Qy 


1847 


GQNDILEIKELTKIYRRKRKPAVDRICVGIPPGECFGLLGVNGAGKSSTFKMLTGDTTVT 
GQNDILEIKELTKIYRRKRKPAVDRICIGIPPGECFGLLGWGAGKSTTFKMLTGDTPVT 


1906 


Db 


1021 


1080 


Qy 


1907 


RGDAFLNRNSILSNIHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKVGE 

IIIMIhllMIIIIIIIIIIIMIMMIIIIIIIIlllliiiMiiii 1 II 

RGDAFLNKNSILSNIHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKFGE 


1966 


Db 


1081 


1140 


Qy 


1967 


WAIRKLGLVKYGEKYAGNYSGGNKRKLSTAMALIGGPPWFLDEPTTGMDPKARRFLWNC 

ii ii ii urn II III llll II MM II II III mi mi 

WAIRKLGLVKYGEKYASNYSGGNKRKLSTAMALIGGPPWFLDEPTTGMDPKARRFLraC 


2026 


Db 


1141 


1200 


Qy 


2027 


ALSWKEGRSWLTSHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIWRIA 

iii : iiiiiiiiiiiiiinniiiiiiiniinmimiiiinmiiiiiiiii 

ALSIVKEGRSWLTSHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIWRIA 


2086 


Db 


1201 


1260 


Qy 


2087 


GSNPDLKPVQDFFGLAFPGSVPKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDY 

iiiiiiinnmniini niiiiiiiiiiiiiiniiiiimiiiiiiiiiiii 

GSNPDLKPVOFFF^T.AFP^.qVT.VFVUDMMT.ny™ noor nr. J iiiJJJJ .'.ILL 1 . 


2146 


Db 


1261 


1320 


Qy 


2147 


S VSQTTLDQVFVNFAKDQSDDDHLKDLSLHKNQT WDVAVLTSFLQDEKVKES YV 2 201 

iiiiiiiniiiniiiiiiniiiiiiiiiiiiniiiiiiiiiiniiiini 

SVSQTTLDQVFVNFAKDQSDDDHLKDLSLHKNQTWDVAVLTSFLQDEKVKESYV 13 75 




Db 


1321 





SUMMARIES 



Result 




Query 








No. 


Score 


Match 


Length DB 


ID 


1 


10906 


95 


.1 


2201 


2 


A54774 


2 


3338 . 5 


29 


. 1 


1529 


2 


A59189 


3 


3129.5 


27 


.3 


1472 


2 


B54774 


4 


2638 . 5 


23 


. 0 


1704 


2 


S71363 


5 


2635 . 5 


23 


. 0 


1704 


2 


A59188 


6 


2061 


18 


. 0 


1802 


2 


T33783 


7 


1975 


17 


2 


1816 


2 


A84845 


8 


1854 . 5 


16 


2 


1447 


2 


T15200 


9 


1792 


15 


6 


1317 


2 


C88925 


10 


1535 .5 


13 


4 


1758 


2 


F88559 


11 


1528.5 


13 


3 


1704 


2 


T42749 


12 


1526 


13 


3 


1767 


2 


S60124 


13 


1393 


12 


1 


1246 


2 


T00826 


14 


1150.5 


10 


0 


1564 


2 


T27121 


15 


1006 .5 


8 . 


8 


373 


2 


T47150 


16 


1005.5 


8 . 


8 


1431 


2 


T22748 


17 


846 


7 . 


4 


269 


2 


T46467 


18 


786.5 


6 . 


9 


1011 


2 


T07712 


19 


695 


6 . 


1 


900 


2 


T07717 



Description 



ATP binding casset 
ATP-binding casset 
ATP binding casset 
probable ATP-bindi 
ATP-binding casset 
hypothetical prote 
probable ABC trans 
hypothetical prote 
protein F33E11.4 [ 
protein C48B4 .4b [ 
ATP-binding casset 
transport protein 
hypothetical prote 
hypothetical prote 
hypothetical prote 
hypothetical prote 
hypothetical prote 
probable ABC- type 
probable ABC -type 



RESULT 1 
A54774 

ATP binding cassette transporter ABC1 - mouse 
C; Species: Mus musculus (house mouse) 

C;Date: 05-Apr-1995 #sequence__revision 05-Apr-1995 #text change 02-Feb-2001 
C;Accession: A54774 ~ 
R;Luciani, M.F.; Denizot, f . ; Savary, S.; Mattel, M.G.; Chimini, G. 
Genomics 21, 150-159, 1994 

A; Title: Cloning of two novel ABC transporters mapping on human chromosome 9 

A;Reference number: A54774; MUID : 943 75008 ; PMID: 8088782 

A; Accession : A54 7 74 

A; Molecule type: mRNA 

A /Residues: 1-2201 <LUC> 

A/Cross-references: GB:X75926; NID:g495256; PIDN : CAA53 53 0 . 1 ; PID:g495257 
C;Superfamily : unassigned ATP-binding cassette proteins; ATP-binding cassette homology 
C; Keywords: ATP; duplication; nucleotide binding; P-loop 
F; 856-1047/Domain: ATP-binding cassette homology ' <ABC1> 
F; 873 -880/Region : nucleot ide-binding motif A (P-loop) 



Db 



F;1869 2060/Domain: ATP-binding cassette homology <ABC2> 
F;1886-l893/Region: nucleot ide-binding motif A (P-loop) 

Query Match 95.1%; Score 10906; DB 2; Length 2201; 

Best Local Similarity 94.8%; Pred . No. 0; 

Matches 2087; Conservative 54 ; Mismatches 60; Indels 0; Gaps o ; 
Qy 1 MPSAGTLPWVQGIIC^ 6Q 

Db 1 mpsagt lpwqgiicna^pcfryptpgeapgwgnLksivsrl^ 6 o 

Qy 61 smkdmr kvlrtlqqikksssnlklqdflvdnetfsgflyhnlslpkstvdkmlradvilh 120 

I : l I I INI hill : I I I I I I I I I M I I I I I I M I I I I I h I I I I : I : : I | 
UO 61 SIKDMHKVLR MLRQIKHPNSNLKLQDFLVDNETFSGFLQHNLSLPRSTVDSLLQXNVGLQ 120 

Qy 121 KVFLQGYQLHLTSLCNGSKSEEMIQLGDQEVSELCGLPREKLAAAERVLRSNMDILKPIL 180 

10 iMiniiiii iiiiiii ihiiin iii iinihii mini MMIIh: 

121 KVFLQGYQLHLASLCNGS KLEE 1 1 QLGDAEVS ALCGLPRKKLDAAERVLRYNMD I LKP W 180 
QY 181 RTLNSTSPFp SKELAEATKTLLHSLGTLAQELFSMRSWSDMRQEVMFLTNVNSSSSSTOI 240 

11 11 I I- II III I Ml | IIIIMI hill III || II inn mi nil 

Db 181 TKLNST SHLPTQHLAEATTVLLDSLGGLAQELFSTKSWSDMRQEVMFLTNVNSSSSSTQI 240 

Q Y 241 yQAVSRIVCGHPEGGGLKIKSLNWYEDNNYKALFGGNGTEEDAETFYDNSTTPYCNDLMK 300 

300 



I I I I I I I I I I I I I I I I I I I I I I I I I I | | I I I I I I I I I I I I I h II I I I I I I I | | | | | | 
Db 241 Y Q AV SRIVCGHPEGGGLKIKSLNWYEDNNYKALFGGNNTEEDVDTFYDNSTTPYCNDLMK 



Q y 301 NLESSPLSRIIWKALKPLLVGKILYTPDTPATRQVKAEVNKTFQELAVFHDLEGMWEELS 360 



I I I I I I I I I I I , , , . ,,,,,,,, 

Db 301 NLESSPL SRIIWKALKPLLVGKILYTPDTPATRQVMAEVNKTFQELAVFHDLEGMWEELS 360 

QY 361 PK ™TFMENSQEMDLVF^ 420 

420 



MIMIMMMMM INN II MlhllllllllllhllllhlllMI In 

Db 3 61 PQIWTFMENSQEMDLVRTLLDSRGNDQFWEQKLDGLDWTAQDIMAFLAKNPEDVQSPNGS 



Qy 421 VYTWREAFNETNQAIRTISRFMECVNLNKLEPIATEVWLINKSMELLDERKFWAGI VFTG 4 80 

I I I I I I I I I I I I I I h I I I I I I I I I I I I I I I I I III I I I I | | | | | | | | | | h I I I II I 
Db 421 VYTWREAFNETNQAIQTISRFMECVNLNKLEPIPTEVRLINKSMELLDERKFWAGIVFTG 480 



Q y 481 jTPGSIELPHHVKYKIRMDIDNVERTNKIKDGYWDPGPRADPFEDMRYVWGGFAYLQDVV 540 

Db 481 ITPD SVELPHHVKYKIRMDIDNVERTNKIKDG^ 540 

Qy 541 EQAIIRVLTGTEKKTGVYMQQMPYPCYVDDIFLRVMSRSMPLFMTLAWIYSVAVIIKGIV 600 

iiiiiiiiihiiiiiihiiiiiiiiiiiiiiiiihiniiiniiiiiniin n 

Db 541 EQAIIRVLTGSEKKTGVYVQQMPYPCYVDDIFLRVMSRSMPLFMTLAWIYSVAVIIKSIV 600 

Qy 601 YEKEARLKETMRIMGLDNSILWFSWFISSLIPLLVSAGLLWILKLGNLLPYSDPSWFV 660 

hiiiiiiiniiiihi iiihihin i,iiMii 



"Mill I f 1 M I I ! I I : M M | f || || | || | | J | | | | | | 

Db 601 YEKEARLKETMRIMGLDNGILWFSWFVSSLIPLLVSAGLLWILKLGNLLPYSDPSWFV 660 



Q y 661 FLSVFAWTILQCFLISTLFSRANLAAACGGIIYFTLYLPYVLCVAWQDYVGFTLKIFAS 720 

I I I I I I : I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | :: | | | | | 
Db 661 FLSVFAMVTI LQCFLI STLFSRAN1AAACGGI I YFTLYLPYVLCVAWQDYVGFS I KI FAS 720 

721 LLSPVAFGFGCEYFALFEEQGIGVQWDNLFESPVEEDGFNLTTSVSMMLFDTFLYGVMTW 780 



I II 1 1 Ml II INN I IIIIIII Mil I II llllllllhll III II Nil Ml II 

Db 721 LLSPVAFGFGCEYFALFEEQGIGVQWDNLFESPVEEDGFNLTTAVSMMLFDTFLYGVMTW 



780 



Q y 781 y^AVFPGQYGIPRPWYFPCTKSYWFGEESDEKSHPGSNQKRISEICMEEEPTHLKLGVS 840 



IMM I II I I II M M 



CI:. 781 YIEAVFPGQYGIPRPWYFPCTKSYWFGEEIDEKSHPGSSQK^ 



840 



0^ 841 IQNLVKyYRDGMKVA^ 900 

Db 841 iqnlvkvtrdgmkvavdglalnfyegqitsflghnU^ 900 

Qy 901 GKDIRSEMSTIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEKHVKAEMEQMALDVG 960 

IIIIIIIMMIMIIMIIIMIIIIIinillllllllllMMIIIMIIIMII 

'MALDVG 96 0 



Db 901 GKDIRSEMSSIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLS^ 



961 
961 



' " * ^^^^ ^J^^^ ^T^^^ ^"^^^ T*^^ 7 ^^ 31 X-j JI> ^* TT'-^^C^ ^^XZ> ^* ZC EI X-i JQj Kl tr. 102 0 



1020 



11 M M I I I I I f I II I i I I f I I I I H | [ ] | j | | [ j | | j | I 

LPPSKLKSKTSQLSGGMQRKLSVALAFVGGSKWILDEPTAGVDPYSRRGIWELLLKYRQ 

1021 GRTI ILSTHHMDEADVLGDRIAI I SHGKLCCVGS SLFLKNQLGTGYYLTLVKKDVESSLS 10 8 0 

I I I I I I I N I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | [ | | | | | | | | | | | | | | | | | 
1021 GRTI I LSTHHMDEADILGDRI AI I SHGKLCCVGS SLFLKNQLGTGYYLTLVKKDVESSLS 10 8 0 

1081 SCRNSSSTVSCLKKEDSVSQSSSDAGLGSDHE^ 114Q 

1141 ^X EI ZE_» ^tT^V" I_» X 3, "btf -^^^^ !K1 X^ T^X*T ZE_i tX IE JIZ> i^l X_i E3 X_» IX "^kTC^- HI :S X^T Tr" IEj HI Xj Kl'V"^?^ X€I EI "VTZU 1200 

1141 GHELTYVLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETT 120 0 

12 01 TSDGTLPARRNRRAFGDKQSCLRPFTEDDAADPNDSDIDPESRETDLLSGMDGKGS YOVK 12 6 0 

^ iUliniiilli 11 ! 1 11111 i mi " i in ii m Mi iii mi i in 1 1 ii i hi 

^ ^ x ± ^xr^xvi^r^r ^DKQSCLKPFTEuuAVupNuyulDPESRETDLLSGMDGKGSYQLK 12 6 0 
1261 GWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSLIVPPFGKYPSLELQ 1320 



Mlllllll 



II 



1261 GWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSLIVPPFGKYPSLELQ 1320 
13 21 PWMYNEQYTFVSNDAPEDTGTLELLNALTKDPGFGTRCMEGNPIPDTPCQAGEEEWTTAP 13 8 0 



1321 PWMYNEQYTFVSNDAPEDMGTQELLNALTKDPGFGTRCMEGNPIPDTPCLAGEEDWTISP 1380 
1381 VPQTIMDLFQNGNWTMQNPSPACQCSSDKIKKMLPVCPPGAGGLPPPQRKQNTADILQDL 1440 



1381 VPQSIVDLFQNGNWTMKNPSPACQCSSDKIKKMLPVCPPGAGGLPPPQRKQKTADILQNL 1440 
14 41 TGRNI SDYLVKTYVQI I AKSLKNKI WVNEFRYGGFSLGVSNTQALPPSQEVNDATKQMKK 15 00 



IIIIIMMIIIIIIIillMIMimilllllhllllll Mill INI 



1441 TGRNI SDYLVKTYVQ 1 1 AKSLKNKI WVNEFR YGGFSLGVSNSQALPPSHE VNDAI KQMKK 15 0 0 
1501 HLKLAKDS S ADRFLNS LGRFMTGLDTRNNVKVWFNNKGWHAI S S FLNVINNAI LRANLQK 1560 



in nniiiiniiiiii miMiiiiiinihiiiimiimiiii 



1501 LLKLTKDTSADRFLSSLGRFMAGLDTKNNVKWFNNKGWHAISSFLNviNNAILRAN 1560 
1561 GENPSHYGITAFNHPLNLTKQQLSEVAPMTTSVDVLVSICVIFAMSFVPASFWFLIQER 1620 



I II 



II 



1561 GENPSQYGITAFNHPLNLTKQQLSEVALMTTSVDVLVSICVIFAMSFVPASFVVFLIQER 1620 
1621 VSKAKHLQFISGVKPVIYWLSNFWDMCNYWPATLVIIIFICFQQKSYVSSTNLPVLAL 1680 



1621 VSKAKHLQFISGVKPVIYWLSNFVWDMCNYWPATLVIIIFICFQQKSYVSSTNLPVLAL 1680 
1681 LLLLYGWSITPLMYPASFVFKIPSTAYWLTSVNLFIGINGSVATFVLELFTDNKLNNIN 1740 



16 81 LLLLYGWS ITPLMYPASFVFKI PSTAYWLTSVNLFIGINGSVATFVLELFTNNKLNDIN 174 0 
1741 DILKSVFLIFPHFCLGRGLIDMVKNQAMADALERFGENRFVSPLSWDLVGRNLFAMAVEG 1800 



IIIMlllllllllllllllllllllllllllllllllllllllllllllllin 



1741 DILKSVFLIFPHFCLGRGLIDMVKNQAMADALERFGEN^ 1800 
1801 VVFFLITVLIQYRFFIRPRPVNAKLSPLNDEDEDVRRERQRILDGGGQNDILEIKELTKI 1860 



iiimiiiiiiiii in iniiiniiiiiiiiiiiiiiii 



1801 WFFLITVLIQYRFFIRPRPVKAKLPPLNDEDEDVRRERQRILDGGGQNDILEIKELTKI 1860 
1861 YRRKRKPAVDRICVGIPPGECFGLLGVNGAGKSSTFKMLTGDTTVTRGDAFLNRNSILSN 192 0 



18 61 YRRKRKPAVDRI C IG I PPGECFGLLGVNGAGKSTTFKMLTGDTPVTRGDAFLNKNSILSN 192 0 
1921 IHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKVGEWAIRKLGLVKYGEK 19 8 0 



n I I II I I I I II I II I I I I I I I I || I I I I | II | | I II I II I I I I II 



19 21 IHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKFGEWAIRKLGLVKYGEK 19 8 0 
1981 YAGNYSGGNKRKLSTAMALIGGPPWFLDEPTTGMDPKARRFLWNCALSWKEGRSWLT 2040 



Ml Mill II II III III III II II I II IIIMIIIIIIIII IMMMIMM 



1981 YASNYSGGNKRKLSTAMALIGGPPWFLDEPTTGMDPKARRFLWNCALSIVKEGRSWLT 2040 



wy 


2 041 


SHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIWRIAGSNPDLKPVODFFG 
SHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIWRIAGSNPDLKPVQEFFG 


2100 


Db 


2041 


2100 


Qy 


2101 


LAFPGSVPKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDYSVSQTTLDOVFVNF 

IIIIMI IIIIIIIIIMMIIIIIIIIMIIIIIMIIIIIIMM 

LAFPGSVLKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDYSVSQTTLDQVFVNF 


2160 


Db 


2101 


2160 


Qy 


2161 


AKDQSDDDHLKDLSLHKNQTVVDVAVLTSFLQDEKVKESYV 2201 

MMMIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIII 

AKDQSDDDHLKDLSLHKNQTWDVAVLTSFLQDEKVKES YV 2 2 01 




Db 


2161 





SUMMARIES 

Result Query 



No . 


Score 


Match 


Length DB 


ID 




Description 


1 


11466 


100 


. 0 


2261 


1 


ABC1 


_HUMAN 


095477 


homo sapien 


2 


10906 


95 


.1 


2261 


1 


ABCl^ 


_MOUSE 


P41233 


mus musculu 


3 


5689.5 


49 


.6 


2273 


1 


ABCR~ 


_HUMAN 


P78363 


homo sapien 


4 


4131 


36 


. 0 


2436 


1 


ABC 2 


_HUMAN 


Q9bzc7 


homo sapien 


5 


3989.5 


34 


. 8 


2434 


1 


ABC2~ 


_MOUSE 


P41234 


mus musculu 


6 


2635.5 


23 


. 0 


1704 


1 


ABC3~ 


_HUMAN 


Q99758 


homo sapien 


7 


1528 . 5 


13 


.3 


1704 


1 


CED7~ 


_CAEEL 


P34358 


caenorhabdi 


8 


411 


3 , 


. 6 


330 


1 


drra" 


_STRPE 


P32010 


streptomyce 


9 


380.5 


3 , 


. 3 


343 


1 


NODI~ 


_RHISN 


P55476 


rhizobium s 


10 


366 


3 . 


, 2 


304 


1 


nodi] 


_RHIS3 


P72335 


rhizobium s 


11 


347 


3 . 


0 


308 


1 


YADG_ 


_ECOLI 


P36879 


escherichia 


12 


347 


3 . 


0 


335 


1 


NDI2_ 


_RHIME 


Q8gnh6 


rhizobium m 


13 


344 . 5 


3 . 


0 


340 


1 


nodi" 


_RHILO 


P23703 


rhizobium 1 


14 


343 . 5 


3 . 


0 


347 


1 


NODI_ 


_RHIGA 


P50332 


rhizobium g 


15 


335 . 5 


2 . 


9 


1280 


1 


MDR1_ 


_HUMAN 


P08183 


homo sapien 


16 


331.5 


2 . 


9 


355 


1 


NDI1_ 


RHIME 


052618 


rhizobium m 


17 


329.5 


2 . 


9 


578 


1 


YBHF_ 


ECOLI 


P75776 


escherichia 


18 


327 


2 . 


9 


894 


1 


YHIH_ 


ECOLI 


P37624 


escherichia 


19 


327 


2 . 


9 


1276 


1 


MDR3_ 


MOUSE 


P21447 


mus musculu 



RESULT 2 
ABCl_MOU3E 

ID ABCl_MOUSE STANDARD; PRT ; 2261 AA . 

AC P41233; 

DT 01-FEB-1995 (Rel . 31, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE ATP-binding cassette, sub-family A, member 1 (ATP-binding cassette 

DE transporter 1) (ATP-binding cassette 1) (ABC-1) . 

GN ABCA1 OR ABC1 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=DBA/2; TISSUE=Macrophage ; 

RX MEDLINE-94375008; PubMed=8 0 8 8 782 ; 

RA Luciani M.F., Denizot F., Savary s., Mattel M.-G., Chimini G.; 

RT "Cloning of two novel ABC transporters mapping on human chromosome 

RT 9 . " ; 

RL Genomics 21:150-159(1994). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C5 7BL/6 J ; 

RX MEDLINE-21251004; PubMed=11352567 ; 

RA Qiu Y., Cavelier L., Chiu S., Yang X., Rubin E., Cheng J.-F.; 

RT "Human and mouse ABCA1 comparative sequencing and transgenesis 



RT 

RL 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 



studies revealing novel regulatory sequences. 11 ; 
Genomics 73:66-76(2001). 

-!- FUNCTION: CAMP - DE PENDENT AND SULFONYLUREA- SENSITIVE ANION 

TRANSPORTER. KEY GATEKEEPER INFLUENCING INTRACELLULAR CHOLESTEROL 

TRANSPORT (BY SIMILARITY) . 
-!- TISSUE SPECIFICITY: WIDELY EXPRESSED IN ADULT TISSUES . HIGHEST 

LEVELS ARE FOUND IN PREGNANT UTERUS AND UTERUS. 
-!- DOMAIN: MULTIFUNCTIONAL POLYPEPTIDE WITH TWO HOMOLOGOUS HALVES, 

EACH CONTAINING AN HYDROPHOBIC MEMBRANE -ANCHORING DOMAIN AND AN 

ATP BINDING CASSETTE (ABC) DOMAIN. 
-!- PTM: Phosphorylation on Ser-2054 regulates phospholipid efflux (By 

similarity) . 

-!- SIMILARITY: BELONGS TO THE ABC TRANSPORTER FAMILY . ABC A SUBFAMILY. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
US ^ by ^ non_p ^ ofit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to licenseOisb- sib . ch) . 

EMBL; X7592 6; CAA5353 0.1; ALT_INIT . 
EMBL; AF2 8 7263; AAG3 90 73.1; ALT_INIT . 
MGD; MGI : 9960 7 ; Abcal . 

GO; GO:0008203; P : cholesterol metabolism; IDA. 

GO; GO:0030301; P : cholesterol transport; IDA. 

InterPro; IPR0 03 5 93; AAA_ATPase. 

InterPro; IPR00343 9; ABC^transporter . 

Pfam; PF00005; ABC_tran; 2. 

ProDom; PD000006; ABC_transporter ; 2. 

SMART; SM00382; AAA; 2. 

PROSITE; PS00211; ABC_TRANSP0RTER_1 ; 1. 
PROSITE; PS50893; ABCJTRANS PORTER 2; 2. 
ATP-binding 
TRANSMEM 
TRANSMEM 
TRANSMEM 
TRANSMEM 
TRANSMEM 
TRANSMEM 
TRANSMEM 
TRANSMEM 
TRANSMEM 
TRANSMEM 
TRANSMEM 
TRANSMEM 
TRANSMEM 
NP_BIND 
NP_BIND 
MOD RES 



Glycoprotein; Transmembrane; Transport; Phosphorylation. 



26 
640 
690 
717 
749 
771 
1041 
1351 
1661 
1708 
1737 
1775 
1854 
933 
1946 
1042 



MOD_RES 

CARBOHYD 
CARBOHYD 
CARBOHYD 
CARBOHYD 
CARBOHYD 
CARBOHYD 
CARBOHYD 
CARBOHYD 
CARBOHYD 
CARBOHYD 
CARBOHYD 
CARBOHYD 
CARBOHYD 
CARBOHYD 
CARBOHYD 
CARBOHYD 
CARBOHYD 



2054 



42 
656 
706 
733 
765 
787 
1057 
1367 
1677 
1724 
1753 
1791 
1870 
940 
1953 
1042 

2054 



POTENTIAL. 
POTENTIAL . 
POTENTIAL . 
POTENTIAL . 
POTENTIAL . 
POTENTIAL. 
POTENTIAL. 
POTENTIAL . 
POTENTIAL . 
POTENTIAL . 
POTENTIAL . 
POTENTIAL. 
POTENTIAL. 
ATP (POTENTIAL) . 
ATP (POTENTIAL) . 
PHOS PHORYLATION 
SIMILARITY) . 
PHOSPHORYLATION 
SIMILARITY) . 



(BY PKA) 
(BY PKA) 



(MAJOR) (BY 



(MAJOR) (BY 



14 


14 


N 


- LINKED 


(GLCNAC. . 


.) (POTENTIAL) 


98 


98 


N 


-LINKED 


(GLCNAC. . 


.) (POTENTIAL) 


151 


151 


N 


-LINKED 


(GLCNAC. . 


.) (POTENTIAL) 


161 


161 


N- 


-LINKED 


(GLCNAC. . 


. ) (POTENTIAL) 


196 


196 


N- 


-LINKED 


(GLCNAC. . 


. ) (POTENTIAL) 


244 


244 


N- 


- LINKED 


(GLCNAC. . 


. ) (POTENTIAL) 


292 


292 


N- 


-LINKED 


(GLCNAC . . 


.) (POTENTIAL) 


337 


337 


N- 


-LINKED 


(GLCNAC. . 


.) (POTENTIAL) 


349 


349 


N- 


LINKED 


(GLCNAC . . 


.) (POTENTIAL) 


400 


400 


N- 


LINKED 


(GLCNAC. . 


.) (POTENTIAL) 


478 


478 


N- 


LINKED 


(GLCNAC. . 


.) (POTENTIAL) 


489 


489 


N- 


LINKED 


(GLCNAC. . 


.) (POTENTIAL) 


521 


521 


N- 


LINKED 


(GLCNAC. . 


.) (POTENTIAL) 


820 


820 


N- 


LINKED 


(GLCNAC. . 


.) (POTENTIAL) 


1144 


1144 


N- 


LINKED 


(GLCNAC. . 


.) (POTENTIAL) 


1294 


1294 


N- 


LINKED 


(GLCNAC. . 


.) (POTENTIAL) 


1453 


1453 


N- 


LINKED 


(GLCNAC. . 


.) (POTENTIAL) 



FT CARBOHYD 1499 1499 N-LINKED (GLCNAC. . .) (POTENTIAL) 

FT CARBOHYD 1504 1504 N-LINKED (GLCNAC. . .) (POTENTIAL) ' 

FT CARBOHYD 1637 1637 N-LINKED (GLCNAC. . .) (POTENTIAL) '. 

FT CARBOHYD 2044 2044 N-LINKED (GLCNAC. . .) ( POTENTIAL) 

FT CARBOHYD 2238 2238 N-LINKED (GLCNAC. . .) (POTENTIAL) 

FT CONFLICT 1567 1568 MISSING (INREF. 2), 

FT CONFLICT 2024 2024 MISSING (IN REF . 2) . 

SQ SEQUENCE 2261 AA; 254011 MW; FAE62B21FD1D09F9 CRC64 ; 

Query Match 95.1%; Score 10906; DB 1; Length 2261; 

Best Local Similarity 94.8%; Pred. No . 0 ; 

Matches 2 0 87; Conservative 54; Mismatches 60; Indels 0; Gaps 0 

Qy 1 MPSAGTLPWVQGIICNANNPCFRYPTPGEAPGWGNFNKSI VARLFSDARRLLLYSQKDT 60 

lll!IMIIMIIII||||||||l!l!||||||||||||||h||||||:|||||||:|| 
Db 61 MPSAGTLPWVQGIICNANNPCFRYPTPGEAPGWGNFNKSIVSRLFSDAQRLLLYSQRDT 120 

2^ 61 SMKDMRKVLRTLQQIKKSSSNLKLQDFLVDNETFSGFLYHNLSLPKSTVDKMLRADVILH 120 

' : i ' i 1 1 I I I : i i i : i i i i i II M I M M I i I i i I II M I M II I : I : -I I 

Db 121 SIKDMHKVLRMLRQIKHPNSNLKLQDFLVDNETFSGFLQHNLSLPRSTVDSLLQXNVGLQ 180 

Qy 121 KVFLQGYQLHLTSLCNGSKSEEMIQLGDQEVSELCGLPREKLAAAERVLRSNMDILKPIL 180 

MIMIIIIM 1 1 1 1 1 II I hi 1 1 II III II MINI M MM I II III II- 

Db 181 KVFLQGYQLHLASLCNGSKLEEIIQLGDAE VSALCGLPRKKLDAAERVLRYNMDILKPW 240 

Qy 181 RTLNSTSPFPSKELAEATKTLLHSLGTLAQELFSMRSWSDMRQEVMFLTNVNSSSSSTQI 240 

HI I I I- Mill M III I III Ml MMMI Ml MINIM II Ml II 
Db 24i TKLNSTSHLPTQHLAEATTVLLDSLGGLAQELFSTKSWSDMRQEVMFLTNVNSSSSSTQI 300 

Qy 241 YQAVSRI VCGHPEGGGLKIKSLNWYEDNNYKALFGGNGTEEDAETFYDNSTTPYCNDLMK 300 

I i I ' M f M 1 1 1 If 1 1 1 1 1 1 1 1 M ! 1 1 1 1 1 1 1 1 M I f MM MIIIIIIIMIMIM 

Db 301 YQAVSRI VCGHPEGGGLKIKSLNWYEDNNYKALFGGNNTEEDVDTFYDNSTTPYCNDLMK 360 

QY 3 01 NLESSPLSRIIWKALKPLLVGKILYTPDTPATRQVMAEVNKTFQELAVFHDLEGMWEELS 36 0 

ill III I II I II II I I Ml MM I Ml III III III II I MIMIIIIM I III' 

Db 361 NLESSPLSRIIWKALKPLLVGKILYTPDTPATRQVMAEVNKTFQELAVFHDLEGMWEELS 420 

Qy 361 PKIWTFMENSQEMDLVRMLLDSRDNDHFWEQQLDGLDWTAQDIVAFLAKHPEDVQSSNGS 420 

! ; l Ml I II I II II III Mill II IIIIMIIIIIIIMIMIIIIMIIIII Ml 

Db 421 PQIWTFMENSQEMDLVRTLLDSRGNDQFWEQKLDGLDWTAQDIMAFLAKNPEDVQSPNGS 4 80 

Qy 421 VYTWREAFNETNQAIRTISRFMECVNLNKLEPIATEVWLINKSMELLDERKFWAGIVFTG 480 

I I N II I I I I I I I I I M II I I M I I I I I II | | | || | | | || M II II I I | || I I 

Db 481 VYTWREAFNETNQAIQTISRFMECVNLNKLEPIPTEVRLINKSMELLDERKFWAGIVFTG 54 0 

Qy 481 ITp GSIELPHHVKYKIRMDIDNVERTNKIKTJGYWDPGPRADPFEDMRYVWGGFAYLQDW 540 

IN 1 : 1 1 1 1 1 1 M II M I II I II I || M I II I I II I | | | || || M I I II I I II I II || | 
Db 541 ITPDSVELPHHVKYKIRMDIDNVERTNKIKDGYWDPGPRADPFEDMRYVWGGFAYLQDW 600 

Qy 541 EQAIIRVLTGTEKKTGVYMQQMPYPCYVDDIFLRVMSRSMPLFMTLAWIYSVAVIIKGIV 600 

IIIIMIMhlMMMMIMI M I M II I I I I II M I II I I M I I II M II 

Db 601 EQAIIRVLTGSEKKTGVYVQQMPYPCYVDDIFLRVMSRSMPLFMTLAWIYSVAVIIKSIV 660 

Qy 601 YEKEARLKETMRIMGLDNSILWFSWFISSLIPLLVSAGLLWILKLGNLLPYSDPSWFV 660 

. c 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I I M II 1 1 1 II I II II 1 1 1 II I II I II I II 1 1 II 1 1 

Db 661 yEKEARLKETMRIMGLDNGILWFSWFVSSLIPLLVSAGLLVVILKLGNLLPYSDPSVVFV 720 

QY 66 1 FLSVFAVVTILQCFLISTLFSRANLAAACGGIIYFTLYLPYVLCVAWQDYVGFTLKIFAS 720 

1 1 1 1 1 1 : 1 1 1 N I M II M I II I | | || || II I I II I II I I I II II I I M- M I II I 

Db 72i FLS VFAMVTI LQCFLISTLFSRANLAAACGGI I YFTLYLPYVLCVAWQDYVGFS IKI FAS 780 

QY 721 LLSPVAFGFGCEYFALFEEQGIGVQWDNLFESPVEEDGFNLTTSVSMMLFDTFLYGVMTW 780 

I I I i 1 1 1 1 1 I I I I M II II II || I II I I II II I I II I I I II II M II I M I I II II II I I 

Db 78i LLSPVAFGFGCEYFALFEEQGIGVQWDNLFESPVEEDGFNLTTAVSMMLFDTFLYGVMTW 840 

Qy 781 yiEAVFPGQYGIPRPWYFPCTKSYWFGEESDEKSHPGSNQKRISEICMEEEPTHLKLGVS 840 

1 1 I I I I M M I II I II I I M I II I II I II I II II II M I I M II I II II I II I M II I 
Db 841 YIEAVFPGQYGIPRPWYFPCTKSYWFGEEIDEKSHPGSSQKGVSEICMEEEPTHLRLGVS 900 

Qy 841 I Q NLV KVYRDGMKVAVDGLALNFYEGQITSFLGHNGAGKTTTMSILTGLFPPTSGTAYIL 900 

MM Ml Ml II I III IMMI I Ml MM I Ml III MMM MUM I MM M MM 



Db 901 IQNLVKVYRDGMKVAVDGLALNFYEGQITSFLGHNGAGKTTTMSILTGLFPPTSGTAYIL 960 

Qy 901 GKDIRSEMSTIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEKHVKAEMEQMALDVG 96 0 

I I I I I I I I I : I I I I I I I I I I I I I .1 I I I I I I I I I I I I I I I I I I I I I I I I I | | | | M I I I I I 

Db 961 GKD1RSEMSSIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEKHVKAEMEQMALDVG 1020 

Qy 961 LPSSKLKSKTSQLSGGMQRKLSVALAFVGGSKWILDEPTAGVDPYSRRGIWELLLKYRQ 1020 

II i M 1 1 1 Illlllllllllllllll Illlllllllllllllll 

Db 1021 LPPSKLKSKTSQLSGGMQRKLSVALAFVGGSKWILDEPTAGVDPYSRRGIWELLLKYRQ 1080 

Qy 1021 GRTIILSTHHMDEADVLGDRIAIISHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLS 1080 

Db 1081 GRTIILSTHHMDEADILGDRIAIISHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLS 1140 

Qy 1081 SCRNSSSTVSYLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNLIRKHVSEARLVEDI 1140 

MMIIIIII 

Db 1141 SCRNSSSTVSCLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNLIRKHVSEARLVEDI 1200 

Qy 1141 GKELTYVLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEEIFLKVAEESGVDAE 12 0 0 

Db 1201 GHELTYVLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEEIFLKVAEESGVDAE 1260 

Qy 1201 TSDGTLPARRNRRAFGDKQSCLRPFTEDDAADPNDSDIDPESRETDLLSGMDGKGSYQVK 1260 

miiiMiiiiiiiiiiiiii iiiiiii 

Db 1261 TSDGTLPARRNRRAFGDKQSCLHPFTEDDAVDPNDSDIDPESRETDLLSGMDGKGSYQLK 1320 

Qy 1261 GWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSLIVPPFGKYPSLELQ 13 2 0 

I I M HI 1 1 1 1 1 1 M 1 1 1 i 1 1 i i 1 1 1 1 1 1 II 1 1 II I 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1321 GWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSLIVPPFGKYPSLELQ 1380 

Qy 1321 PWMYNEQYTFVSNDAPEDTGTLELLNALTKDPGFGTRCMEGNPIPDTPCQAGEEEWTTAP 1380 

II M Ml I IMIMI III II IINIIIIIIIIIIIIIIIIIIIIIII Nihil 

Db 13 81 PWMYNEQYTFVSNDAPEDMGTQELLNALTKDPGFGTRCMEGNPIPDTPCLAGEEDWTISP 144 0 

Qy 1381 VPQTIMDLFQNGNWTMQNPSPACQCSSDKIKKMLPVCPPGAGGLPPPQRKQNTADILQDL 1440 

1 1 I : I : 1 1 1 1 1 1 ! 1 1 1 : 1 1 1 1 !l 1 1 1 1 1 1 1 1 II 1 1 1 | ! | M 1 1 1 1 1 1 1 [ I ! II II I IN 

Db 1441 VPQSIVDLFQNGNWTMKNPSPACQCSSDKIKKMLPVCPPGAGGLPPPQRKQKTADILQNL 1500 

Qy 1441 TGRNISDYLVKTYVQIIAKSLKNKIWVNEFRYGGFSLGVSNTQALPPSQEVNDATKQMKK 1500 

ii mm inn mi mi i rim 1 1 ii iii ii ii ii in N ii i inn inn 

Db 1501 TGRNISDYLVKTYVQIIAKSLKNKIWVNEFRYGGFSLGVSNSQALPPSHEVNDAIKQMKK 1560 

Qy 1501 HLKLAKDSSADRFLNSLGRFMTGLDTRNNVKVWFNNKGWHAISSFLNVINNAILRANLQK 1560 

III I IN INI IN INN 

Db 1561 LLKLTKDTSADRFLSSLGRFMAGLDTKNNVKVWFNNKGWHAISSFLNVINNAILRANLQK 162 0 

Qy 15 61 GENPSHYGITAFNHPLNLTKQQLSEVAPMTTS VDVLVS I CVI FAMS FVPASFWFLIQER 162 0 

INN 1 1 1 1 1 1 1 1 1 1 1 M 1 1 > I M 1 1 

Db 1621 GENPSQYGITAFNHPLNLTKQQLSEVALMTTSVDVLVSICVIFAMSFVPASFWFLIQER 168 0 

Qy 1621 VSKAKHLQFISGVKPVIYWLSNFVWDMCNYWPATLVIIIFICFQQKSYVSSTNLPVLAL 168 0 

lllllllllllll i I M 1 1 1 1 1 1 1 M I M I i I ! I ! f 1 1 1 [ 1 1 M I i M 1 1 M 1 1 1 1 ! 1 1 1 

Db 16 81 VSKAKHLQFISGVKPVIYWLSNFVWDMCNYWPATLVIIIFICFQQKSYVSSTNLPVLAL 174 0 

Qy 16 81 LLLLYGWSITPLMYPASFVFKIPSTAYWLTSWLFIGINGSVATFVLELFTDNKLNNIN 1740 

Db 1741 LLLLYGWSITPLMYPASFVFKIPSTAYWLTSVNLFIGINGSVATFVLELFTNNKLNDIN 1800 

Qy 1741 DILKSVFLIFPHFCLGRGLIDMVKNQAMADALERFGENRFVSPLSWDLVGRNLFAMAVEG 18 00 

IN N 1 1 llll N IN lllllllllllll IN INN III llll INN INI II INN 

Db 18 01 D1LKSVFLIFPHFCLGRGLIDMVKNQAMADALERFGENRFVSPLSWDLVGRNLFAMAVEG 1860 

Qy 1801 WFFLITVLIQYRFFIRPRPVNAKLSPLNDEDEDVRRERQRILDGGGQNDILEIKELTKI 1860 

IIIINIIIINIIIINIII III IIIINIIIININIIIIINNIINIIIIII 

Db 1861 WFFLITVLIQYRFFIRPRPVKAKLPPLNDEDEDVRRERQRILDGGGQNDILEIKELTKI 1920 

Qy 1861 YRRKRKPAVDRICVGIPPGECFGLLGVNGAGKSSTFKMLTGDTTVTRGDAFLNRNSILSN 1920 

IIIIIIIIIINININIIINNINIIIIININIIIN 

Db 1921 YRRKRKPAVDRICIGIPPGECFGLLGVNGAGKSTTFKMLTGDTPVTRGDAFLNKNSILSN 1980 

Qy 1921 IHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKVGEWAIRKLGLVKYGEK 198 0 



Db 


1981 


Qy 


1981 


Db 


2041 


Qy 


2041 


Db 


2101 


Qy 


2101 


Db 


2161 


Qy 


2161 


Db 


2221 



: I III i I I . I II : I ! II ill 

IHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKFGEWAIRKLGLVKYGEK 2 04 0 
YAGNYSGGNKRKLSTAMALIGGPPWFLDEPTTGMDPKARRFLWNCALSVVKEGRSWLT 2 04 0 

II lllllllllllllllllllMIMIMIIIIIMMIIIMIlMhllMIIMM 

YASNYSGGNKRKLSTAMALIGGPPWFLDEPTTGMDPKARRFLWNCALSIVKEGRSWLT 210 0 
SHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIWRIAGSNPDLKPVQDFFG 210 0 

IIIIIIIIIIIIIIIMIIlMIIillllllllllllMIIIIIIIIIIIMIIIhlM 

SHSMEECEALCTRMAIMWGRFRCLGSVQHLKNRFGDGYTIWRIAGSNPDLKPVQEFFG 2160 
LAFPGSVPKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDYSVSQTTLDQVFVNF 2160 

lllllll IIIIIIIIIIIIIIIIIIIIIMIIIMIIIMIIIIIIIIIIIinillll 

LAFPGSVLKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDYSVSQTTLDQVFVNF 222 0 
AKDQSDDDHLKDLSLHKNQTWDVAVLTSFLQDEKVKESYV 22 01 

IIIIIIIIIIIMIIIMIMIIIIIIMIMIMNIMI 

AKDQSDDDHLKDLSLHKNQTWDVAVLTSFLQDEKVKESYV 2 2 61 



